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The following constitutes the Appellants 1 Brief on Appeal. 

I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Serial No. 09/941,992 recorded November 16, 2001, 
at Reel 012176 and Frame 0450. 

IL RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to a polypeptide referred to 
herein as "PROl 153". There exist two related patent applications, (1) U.S. Serial No. 
09/997,683, filed November 15, 2001 (containing claims directed to antibodies to the PROl 153 
polypeptide), and (2) U.S. Serial No. 09/997,440, filed November 15, 2001 (containing claims 
directed to PROl 153 polypeptides). These two related applications are pending and are being 
examined by the same Examiner. 

III. STATUS OF CLAIMS 
Claims 124, 129-131 and 135-145 are in this application. 
Claims 1-123, 125-128 and 132-134 have been canceled. 

Claims 124, 129-131 and 135-145 stand rejected and Appellants appeal the rejection of 
these claims. 

A copy of the rejected claims in the present Appeal is provided in the Claims Appendix. 

IV. STATUS OF AMENDMENTS 

In an Amendment filed on December 23, 2004 after the mailing of the Final Office of 
October 1, 2004, Claims 1 19-123 were canceled and Claims 124 and 139 were amended to more 
clearly claim what the Appellants always believed was the claimed subject matter in the present 
application. The Advisory Action mailed April 27, 2005 indicated that these amendments were 
entered for purposes of this appeal. Hence, Claims 124, 129-131 and 135-145 are currently 
pending and are under Appeal in this case, and, the rejection under 35 U.S.C. §112, first 
paragraph to Claims 1 19-123 for not satisfying the written description requirement are moot. 
The outstanding rejections are addressed with respect to these pending claims. 
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Further, in an Amendment filed on August 25, 2005 after the mailing of the Final Office 
of October 1, 2004, a declaration by Audrey Goddard, Ph.D. was filed. Entry of this declaration, 
was indicated as acceptable in a telephone conference with Examiner Wegert on August 24, 
2005 for the purposes of this appeal. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

Independent Claim 124 is directed to an isolated nucleic acid of SEQ ED NO: 350; the 
full-length coding sequence of the nucleic acid sequence of SEQ ID NO: 350; or the full-length 
coding sequence of the cDNA deposited under ATCC accession number 209982, and encodes 
for the PRO 1 153 polypeptide. The PROl 153 gene was shown for the first time to be 
significantly amplified in human lung adenocarcinomas or squamous cell carcinomas as 
compared to normal, non-cancerous human tissue controls (2.014 to 2.87 fold in two different 
lung primary tumors). This is set forth in the specification, at least in the 'Gene Amplification 
assay, 1 Example 170, page 539, line 19, to page 555, line 5 (specifically, see Table 9B, page 
551). The profiles of various primary lung tumors used for screening the PRO polypeptide 
compounds of the invention in the gene amplification assay are summarized on Table 8, page 
546 of the specification. This feature is carried by all claims dependent directly or indirectly 
from Claim 124, namely, Claims 129-138. Methods for selecting a host are generally set forth in 
the specification at, for example, in Examples 140-143 and page 376, line 12 onwards (Claims 
137-138), and describes the expression of PRO nucleic acids in various host cells, including E. 
coli y yeast and Baculovirus-infected insect cells. Methods for selecting a vector are generally set 
forth in the specification at, for example, on page 378, line 8 (Claims 135 and 136). 

Independent Claim 139 is directed to an isolated nucleic acid consisting of at least 30 
nucleotide fragment of the nucleic acid sequence of SEQ ID NO: 350, or a complement thereof, 
that specifically hybridizes under highly stringent conditions to (a) the nucleic acid sequence of 
SEQ ID NO: 350 or a complement thereof; or (b) the full-length coding sequence of the cDNA 
deposited under ATCC accession number 209982 or a complement thereof and wherein the 
isolated nucleic acid molecule is suitable for use as a PCR primer or probe. Methods for using 
high stringent hybridization conditions recited in Claim 139 are provided, for example, on page 
312, line 33 onwards. Pending Claims 140-145 depend from Claim 139. Methods for using 
polynucleotides encoding PRO as hybridization probes consisting of nucleotide fragments of 
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varying lengths is described at, for example, on page 285, line 11 onwards, and describes probes 
of at least 50 nucleotides (Claim 140), 60 nucleotides (Claim 141), 70 nucleotides (Claim 142), 
80 nucleotides (Claim 143), 90 nucleotides (Claim 144) and 100 nucleotides (Claim 145). 

Finally, the amino acid sequence of the native "PROl 153" polypeptide and the nucleic 
acid sequence encoding this polypeptide (referred to in the present application as "DNA59842- 
1502") are shown in the present specification as SEQ ID NOs: 351 and 350, respectively, and in 
Figures 246 and 245, respectively. Page 299, lines 34-37 of the specification provides the 
description for Figures 246 and 245. The isolation of cDNA clones encoding PROl 153 of SEQ 
ED NO:350 is described in Example 108, page 490 of the specification. The specification 
discloses that various portions of the encoded PROl 153 polypeptide is very proline rich ad has 
two transmembrane domains (see, for example, page 220-222, line 30 onwards). 

VL GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1. Whether Claims 124, 129-131 and 135-145 should be accorded priority of 
provisional Application 60/141,037, filed 19 November, 2001. 

2. Whether Claims 124, 129-131 and 135-145 satisfy the utility/ enablement 
requirement under 35 U.S.C. §101/1 12, first paragraph. 

VII. ARGUMENTS 

Summary of the Arguments 
Issue 1 : Priority 

The instant application has not been granted the earlier priority date on the grounds that 
"although disclosing the same experimental assays as the instant specification, do not enable the 
instant invention and therefore do not impart Utility. 

Appellants submit that data derived from the Gene Amplification assay was first 
disclosed in U. S. Application Serial No. 60/141,037, filed 19 November, 2001 for the claimed 
PROl 153 encoding gene. Appellants further submit that, the same detailed reasons discussed 
below under the section on Issue II: Utility/ Enablement, are sufficient to also establish 
patentable utility for U. S. Application Serial No. 60/141,037. Hence, Appellants should be able 
to rely upon this provisional application to provide an effective filing date of 19 November, 2001 
for the instant application. 
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Issue 2: Utility/ Enablement 

Claims 124, 129-131 and 135-145 stand rejected under 35 U.S.C. §101/ 1 12, first 
paragraph as allegedly lacking either a specific and substantial asserted utility or a well 
established utility. 

Patentable utility for the PROl 153 polypeptides is based upon the gene amplification 
data for the gene encoding the PROl 153 polypeptide. The specification discloses that the gene 
encoding PROl 153 showed significant amplification, ranging from 2.014 to 2.87 -fold in two 
different lung primary tumors. The Declaration of Dr. Audrey Goddard, submitted with 
Appellants' Response filed August 25, 2005, explains that a gene identified as being amplified at 
least 2-fold by the disclosed gene amplification assay in a tumor sample relative to a normal 
sample is useful as a marker for the diagnosis of cancer , and for monitoring cancer development 
and/or for measuring the efficacy of cancer therapy. Therefore, the gene amplification levels of 
2.014 to 2.87 -fold for lung primary tumors is not a "small increase" as the Examiner contends 
(page 4 of Final Office action dated October 1, 2004). Appellants also submit that, as any skilled 
artisan in the field of oncology would easily appreciate, not all tumor markers are generally 
associated with every tumor, or even with most tumors . Therefore, whether the PROl i 53 gene 
is amplified in few tumor samples or in the vast majority of tumor samples studied is not relevant 
to its identification as a tumor marker, or its patentable utility. Rather, the fact that the 
amplification data for PROl 153 is considered significant is what lends support to its usefulness 
as a tumor marker. Thus, a positive result does indicate the presence of cancer, while a negative 
result requires further follow up testing, testing which is considered routine by one skilled in the 
art of oncology and is not considered undue. 

The Examiner further asserted on page 5 of the Final Office Action mailed October 1, 
2004 that amplification of the PROl 153, polynucleotide does not impart a specific, substantial, 
and credible utility to the PROl 153 polypeptide since, "there is no evidence regarding whether 
or not PROl 153 mRNA or polypeptide levels are also increased in this cancer." In support of 
this assertion, the Examiner cited references by Pennica et al, Haynes et al and Hu et al 

First of all, the claims are directed to nucleic acids , not polypeptides, therefore, the issue 
of whether there is a correlation between gene amplification and polypeptide expression levels is 
irrelevant. One of skill in the art would understand how to use the claimed nucleic acids to 
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detect amplification of the gene encoding PROl 153, and how to use the gene amplification 
results to diagnose cancer. Thus the question of whether or not PROl 153 mRNA or polypeptide 
levels are also increased in these cancers has no relevance to the utility of the claimed nucleic 
acid molecules. Moreover, the teachings of Pennica et al., Haynes et al or Hu et al, do not 
conclusively establish a prima facie case for lack of utility. 

Instead, Appellants submit that based on the gene amplification data and the substantial, 
credible, asserted utility of the PROl 153 gene in the diagnosis of lung cancer, one of ordinary 
skill would know exactly how to make and use these claimed nucleic acids for the diagnosis of 
cancers, without any undue experimentation. 

Response to Rejections 

ISSUE 1, U.S. Provisional Application No, 60/141,037 Satisfies the Utility Requirement of 
35 U.S.C. § 101/ § 112, First Paragraph based on the results of the Gene Amplification assay 

Appellants have asserted that U.S. Provisional Application No. 60/141,037, filed 
November 19, 2001, discloses the gene amplification assay (shown in Example 170 of the instant 
specification) and establishes patentable utility for the claimed PROl 153 polypeptides. 

Appellants submit, for the reasons set forth below under Issue 2 for Utility/ Enablement, 
that the results of the gene amplification assay disclosed in the specification of U.S. Application 
No. 60/141,037, provides at least one credible, substantial and specific asserted utility for the 
claimed PROl 153 polypeptides under 35 U.S.C. §101/§1 12, first paragraph. Accordingly, 
Appellants respectfully request that the subject matter of the instant claims be granted the 
November 19, 2001, priority date of U.S. Provisional Application No. 60/141,037. 

ISSUE 2. Claims 124, 129-131 and 135-145 are supported by a credible, specific and 
substantial asserted utility, and thus meet the utility requirement of 35 U.S.C. § 101/ 112, 
first paragraph 

The sole basis for the Examiner's rejection of Claims 124, 129-131 and 135-145 under 
this section is that the data presented in Example 170 of the present specification is allegedly 
insufficient under the present legal standards to establish a patentable utility under 35 U.S.C. § 
101 for the presently claimed subject matter. 
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Claims 124, 129-131 and 135-145 stand further rejected under 35 U.S.C. §112, first 
paragraph, allegedly "since the claimed invention is not supported by either a specific and 
substantial asserted utility or a well established utility for the reasons set forth above, one skilled in 
the art clearly would not know how to use the claimed invention." 

Appellants strongly disagree and, therefore, respectfully traverse the rejection. 

A. The Legal Standard For Utility Under 35 U.S.C. § 101 

According to 35 U.S.C. § 101: 

Whoever invents or discovers any new and useful process, machine, manufacture, or 
composition of matter, or any new and useful improvement thereof, may obtain a patent 
therefor, subject to the conditions and requirements of this title. (Emphasis added.) 

In interpreting the utility requirement, in Brenner v. Manson the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent applicant disclose a "substantial utility" for his or 

2 

her invention, i.e! a utility "where specific benefit exists in currently available form." The Court 
concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

3 

commerce rather than the realm of philosophy." 

4 

Later, in Nelson v. Bowler the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 
not establish a specific therapeutic use. The court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

1 Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 

5 Id at 856, 206 U.S.P.Q. (BNA) at 883. 
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In Cross v. Iizuka the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 

i.e. there is a reasonable correlation there between." 7 The court perceived "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants 1 statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Applicant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope." ' 

Compliance with 35 U.S.C. §101 is a question of fact. 13 The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 



6 Cross v. Iizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 
* Id. 

9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 
1° Ibid. 

11 In reLanger, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In reJolles, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 
F.2d 974, 144 USPQ 351 (1965); In reSichert, 566 F.2d 1 154, 1 159, 196 USPQ 209, 212-13 
(C.C.P.A. 1977). 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) 
cert, denied, 469 US 835 (1984). 
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totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the applicant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill , in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the applicant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 

("Utility Guidelines") 15 , which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 

regard to defining a 'substantial' utility." 16 Indeed, the Guidelines for Examination of 
Applications for Compliance With the Utility Requirement, 17 gives the following instruction to 
patent examiners: "If the applicant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 



14 In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 

1992). 

15 66 Fed. Reg. 1092 (2001). 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 II (B)(1). 
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B. Proper Application of the Legal Standard 

Appellants respectfully submit that the data presented in Example 170 starting on page 
539 of the specification of the specification and the cumulative evidence of record, which 
underlies the current dispute, indeed support a "specific, substantial and credible" asserted utility 
for the presently claimed invention. 

Example 170 describes the results obtained using a very well-known and routinely 
employed polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred 
to herein as the gene amplification assay. This assay allows one to quantitatively measure the 
level of gene amplification in a given sample, say, a tumor extract, or a cell line. It was well 
known in the art at the time the invention was made that gene amplification is an essential 
mechanism for oncogene activation. Appellants isolated genomic DNA from a variety of primary 
cancers and cancer cell lines that are listed in Table 9 (pages 539 onwards of the specification), 
including primary lung and colon cancers of the type and stage indicated in Table 8 (page 546). 
The tumor samples were tested in triplicates with Taqman™ primers and with internal controls, 
beta-actin and GADPH in order to quantitatively compare DNA levels between samples (page 
548, lines 33-34). As a negative control, DNA was isolated from the cells often normal healthy 
individuals, which was pooled and used as a control (page 539, lines 27-29) and also, no- 
template controls (page 548, lines 33-34). The results of TaqMan™ PCR are reported in ACt 
units, as explained in the passage on page 539, lines 37-39. One unit corresponds to one PCR 
cycle or approximately a 2-fold amplification, relative to control, two units correspond to 4-fold, 
3 units to 8-fold amplification and so on . Using this PCR-based assay, Appellants showed that 
the gene encoding for PROl 153 was amplified, that is, it showed approximately 1.01- 1.52 ACt 
units for lung tumors which corresponds to 2 1,01 -2 1 52 - fold amplification in lung tumors, or 
2.014 to 2.87 -fold in two different lung primary tumors. 

Previously, in the Office Action mailed March 23, 2004 and on page 6 of the Final Office 
action mailed October 1, 2004, the Examiner stated based on Hittelman et al that "an increase in 
chromosome number is a common occurrence in cancerous cells and would result in a positive 
ACt measurement in the instant Specification." Appellants partly agree and disagree with this 
statement. 

Hittelman studied premalignant lesions and suggests that epithelial tumors develop 
through a multistep process driven by genetic instability (see abstract). Hittelman showed that a 
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subset of the same molecular changes found in associated tumor were also found in premalignant 

lesions, suggesting that these premalignant lesions might represent precursor lesions for 

associated tumors, i.e., a manifestation of a multistep tumorigenesis process. (See Hittelman, 

page 4, last three lines). Appellants therefore submit that, contrary to the Examiner's rejection, 

the Hittelman reference strongly supports the Appellants position that there is utility in 

identifying genetic biomarkers in epithelial tissues at cancer risk (also see Hittelman, abstract, 

line 4-7). Hittelman adds on page 2, fourth paragraph, line 3 that "it is important to identify 

individuals at significantly increased cancer risk who might best benefit from different types of 

intervention". Taken together, even if Appellants were to show that the observed PROl 153 gene 

amplification were due to chromosomal aneuploidy (which Appellants do not contend to), 

identifying genetic biomarkers like the PROl 153 gene with this aneuploidy is a very important 

and useful step, according to Hittelman, in identifying individuals at significantly increased 

cancer risk. Therefore, Hittelman supports at least one utility for the PROl 153 gene, that is, as a 

genetic biomarker for cancer or precancerous cells. As one skilled in the art would clearly know, 

early detection of lung cancer provides information in advance about risk assessment, prognosis 

and therapy for lung cancer. 

. Further, the Examiner acknowledges that there was a "increase" in DNA, but stated that 

the increase was "very small" (Office action mailed March 23, 2004). Regarding this rejection 

Appellants submit that the Examiner seems to be applying a heightened utility standard in this 

instance, which is legally incorrect. Instead, whether this "increase" is "significant" needs to be 

addressed. As evidence that the "increase in DNA" in the gene amplification assay is significant, 

Appellants submitted a Declaration by Dr. Audrey Goddard in the Supplemental response filed 

August 25, 2005. The Declaration by Dr. Audrey Goddard provides a statement by an expert in 

the relevant art stating that "fold amplification" values of at least 2-fold are considered 

significant in the TaqMan™ PCR gene amplification assay. Appellants particularly draw the 

Board's attention to page 3 of the Goddard Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (z.e., non-tumor) sample is 
significant and useful in that the detected increase in gene copy number in the tumor 
sample relative to the normal sample serves as a basis for using relative gene copy 
number as quantitated by the TaqMan PCR technique as a diagnostic marker for the 
presence or absence of tumor in a tissue sample of unknown pathology. Accordingly, a 
gene identified as being amplified at least 2-fold by the quantitative TaqMan PCR assay 
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in a tumor sample relative to a normal sample is useful as a marker for the diagnosis of 
cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. (Emphasis added). 

Accordingly, the 2.014 to 2.87 -fold in two different lung primary tumors would be 
considered significant and credible by one skilled in the art, based upon the facts disclosed in the 
Goddard Declaration. 

Further Appellants submit that the fact that two lung tumor samples and three colon 
tumor samples tested positive in this study does not make the gene amplification data, by any 
means, less significant or spurious. As any skilled artisan in the field of oncology would easily 
appreciate, not all tumor markers are generally associated with every tumor, or even, with most 
tumors. In fact, some tumor markers are useful for identifying rare malignancies . That is, the 
association of the tumor marker with a particular type of tumor lesion may be rare, or, the 
occurrence of that particular kind of tumor lesion itself may be rare. In either event, even these 
rare tumor markers, which may not give a positive hit with most common tumors, have great 
value in tumor diagnosis, and consequently, in tumor prognosis . The skilled artisan would know 
that such tumor markers are very useful for better classification of tumors. Therefore, whether 
the PRO 1 153 gene is amplified in two lung tumors or in most lung tumors is not relevant to its 
identification as a tumor marker, or its patentable utility. Rather, whether the amplification data 
for PRO 1 153 is significant is what lends support to its usefulness as a tumor marker. It was well 
known in the art at the time of filing of the application that gene amplification, which occurs in 
most solid tumors like lung cancers, is generally associated with poor prognosis. Therefore, the 
PRO 1 153 gene becomes an important diagnostic marker to identify such malignant lung cancers, 
even if the malignancy associated with PROl 153 molecule is a rare occurrence . Accordingly, the 
present specification clearly discloses enough evidence that the gene encoding the PROl 153 
polypeptide is significantly amplified in certain types of lung tumors and is therefore, a valuable 
diagnostic marker for identifying certain types of lung cancers. 

C. The Utility of the Claimed Nucleic Acids Does Not Depend Upon the 
Properties of the Encoded Polypeptide 

On page 5 of the final Office Action mailed October 1, 2005, the Examiner points out 
that "there is no evidence regarding whether or not PROl 153 mRNA or polypeptide levels are 
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also increased in this cancer". The Examiner points out, especially on page 5 of the Final Office 
Action, that: 

"what is often seen is a lack of correlation between DNA amplification and mRNA levels 
(Pennica et al) As discussed by Haynes et al, polypeptide levels cannot be accurately 
predicted from mRNA levels. . .the literature cautions researches against drawing 
conclusions based on small changes in transcript expression levels between normal and 
cancerous tissue (Hu et al.)" 

Appellants strongly disagree. First of all, Appellants submit that the instant claims are 
directed to nucleic acids , not polypeptides, therefore, the issue of whether there is a correlation 
between gene amplification and polypeptide expression levels is irrelevant . One of skill in the 
art would understand how to use the claimed nucleic acids to detect amplification of the gene 
encoding PROl 153, and how to use the gene amplification results to diagnose cancer. Thus the 
question of whether or not PROl 153 mRNA or polypeptide levels are also increased in these 
cancers has no relevance to the utility of the claimed nucleic acid molecules. 

The claimed nucleic acids can be used in cancer diagnosis without any knowledge 
regarding the function or cellular role of the encoded protein. Appellants submit that the law 
clearly states that "it is not a requirement of patentability that an inventor correctly set forth, or 
even know, how or why the invention works." Newman v. Quigg, 1 1 USPQ2d 1340 (Fed. Cir. 
1989). Accordingly, the disclosure or identification of the mechanism by which PROl 153 is 
associated with cancer is not required in order to establish the patentable utility of the claimed 
PROl 153 nucleic acids. 

D. A prima facie case of lack of utility has not been established 

The Examiner further cited Hu et al, to show that " the literature cautions researchers 
against drawing conclusions based on small changes in transcript expression levels between 
normal and cancerous tissues" (Page 5 of the Final Office action mailed October 1, 2004). 

First of all, as discussed above, the increase in DNA copy number for the PROl 153 gene 
is significant and would not be considered "small" according to the Goddard declaration. 
Further, contrary to the Examiner's assertion, the cited Hu et al reference does not conclusively 
establish a prima facie case for lack of utility for the PROl 153 molecule. The Hu et al 
reference is entitled "Analysis of Genomic and Proteomic Data using Advanced Literature 
Mining" (emphasis added). Therefore, as the title itself suggests, the conclusions in this 
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reference are based upon statistical analysis of information obtained from published literature, 
and not from experimental data. Hu et al performed statistical analysis to provide evidence for a 
relationship between mRNA expression and biological function of a given molecule (as in 
disease). The conclusions of Hu et al. however, only apply to a specific type of breast tumor 
(estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast cancer genes 
in general, let alone to cancer genes in general. Interestingly, the observed correlation was only 
found among ER-positive (breast) tumors not ER-negative tumors." (See page 412, left column). 

Moreover, the analytical methods utilized by Hu et al. have certain statistical drawbacks, 
as the authors themselves admit. For instance, according to Hu et al, "different statistical 
methods" were applied to "estimate the strength of gene-disease relationships and evaluated the 
results." (See page 406, left column, emphasis added). Using these different statistical methods, 
Hu et al "[assessed the relative strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation." (See page 411, left column). As is well known in the 
art, different statistical methods allow different variables to be manipulated to affect the resulting 
outcome. In this regard, the authors disclose that, "Initial attempts to search the literature " using 
the list of genes, gene names, gene symbols, and frequently used synonyms generated by the 
authors "revealed several sources of false positives and false negatives." (See page 406, right 
column). The authors add that the false positives caused by "duplicative and unrelated meanings 
for the term" were "difficult to manage." Therefore, in order to minimize such false positives, 
Hu et al disclose that these terms "had to be eliminated entirely, thereby reducing the false 
positive rate but unavoidably under-representing some genes." Id. (emphasis added). Hence, Hu 
et al had to manipulate certain aspects of the input data, in order to generate, in their opinion, 
meaningful results. Further, because the frequency of citation for a given molecule and its 
relationship to disease only reflects the current research interest of a molecule, and not the true 
biological function of the molecule, as the authors themselves acknowledge, the "[Relationship 
established by frequency of co-citation do not necessarily represent a true biological link." (See 
page 41 1, right column). Therefore, based on these findings, the authors add, "[t]his may reflect 
a bias in the literature to study the more prevalent type of tumor in the population. Furthermore, 
this emphasizes that caution must be taken when interpreting experiments that may contain 
subpopulations that behave very differently." Id. (Emphasis added). In other words, some 
molecules may have been underrepresented merely because they were less frequently cited or 
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studied in literature compared to other more well-cited or studied genes. Therefore, Hu et al.'s 
conclusions are not based on genes/mRNA in general. 

Therefore, Appellants submit that, based on the nature of the statistical analysis 
performed herein, and in particular, based on Hu's analysis of only one class of genes, namely, 
the estrogen receptor (ER)-positive breast tumor genes, the conclusions drawn by the Examiner, 
namely that, "genes displaying a 5-fold change or less (mRNA expression) in tumors compared 
to normal showed no evidence of a correlation between altered gene expression and a known role 
in the disease (in general)" is not reliably supported. 

In conclusion, when the proper legal standard is used, a prima facie case of lack of utility 
has not been met based on the cited references Pennica et al., Haynes et al. or Hu et al by the 
Examiner. 

On the contrary, Appellants submit that Example 170 in the specification discloses 
evidence to show that the PRO 1 153 gene is amplified in certain lung cancers, and is therefore 
useful in the diagnosis of these cancers. 

Thus, based on the asserted utility for the PROl 153 gene in the diagnosis of selected lung 
tumors, the reduction to practice of the instantly claimed nucleic acid sequence of SEQ ED NO: 
350 in the present application (also see pages 220-222), the disclosure of the step-by-step 
protocols for making and isolating cDNA clones encoding PROl 153 of SEQ ID NO:350 at least 
in Example 108, page 490 of the specification, the disclosure of a step-by-step protocol for 
expressing PROl 153 cDNA in appropriate host cells (in Examples 140-143 and page 376, line 
12), the step-by-step protocol of the gene amplification assay in Example 170, the skilled artisan 
would know exactly how to make and use the claimed nucleic acids for the diagnosis of lung 
cancers. Appellants submit that based on the detailed information presented in the specification 
and the advanced state of the art in oncology, the skilled artisan would not have found any 
experimentation associated with testing lung or colon tumors, given the present disclosure, 
'undue'. 

Therefore, since the instantly claimed invention is supported by either a credible, specific 
and substantial asserted utility or a well-established utility, and since the present specification 
clearly teaches one skilled in the art "how to make and use" the claimed invention without undue 
experimentation, Appellants respectfully request reconsideration and reversal of this outstanding 
rejections under 35 U.S.C. §101 and §1 12, First Paragraph to Claims 124, 129-131 and 135-145. 
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CONCLUSION 

For the reasons given above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed polypeptide. As such, Appellants respectfully 
request reconsideration and reversal of the outstanding rejection of claims 124, 129-131 and 135- 
145. 



Respectfully submitted, 



Date: November 14, 2005 

Ginger R. Dreger, Esq. 
Reg. No. 33,055 



HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 
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IX. CLAIMS APPENDIX 



Claims on Appeal 

124. An isolated nucleic acid comprising: 

(a) the nucleic acid sequence of SEQ ID NO: 350; 

(b) the full-length coding sequence of the nucleic acid sequence of SEQ ID NO: 350; 

or 

(c) the full-length coding sequence of the cDNA deposited under ATCC accession 
number 209982. 

129. The isolated nucleic acid of Claim 124 comprising the nucleic acid sequence of 
SEQ ID NO: 350. 

130. The isolated nucleic acid of Claim 124 comprising the full-length coding 
sequence of the nucleic acid sequence of SEQ ID NO: 350. 

131. The isolated nucleic acid of Claim 1 24 comprising the full-length coding 
sequence of the cDNA deposited under ATCC accession number 209982. 

135. A vector comprising the nucleic acid of Claim 124. 

136. The vector of Claim 135, wherein said nucleic acid is operably linked to control 
sequences recognized by a host cell transformed with the vector. 

137. A host cell comprising the vector of Claim 135. 

138. The host cell of Claim 137, wherein said cell is a CHO cell, an E. coli or a yeast 

cell. 

139. An isolated nucleic acid molecule consisting of a fragment of the nucleic acid 
sequence of SEQ ID NO: 350, or a complement thereof, of at least 20 nucleotides in length that 
hybridizes under stringent conditions to: 

(a) the nucleic acid sequence of SEQ ID NO:350 or a complement thereof; 

(b) the full-length coding sequence of the cDNA deposited under ATCC accession 
number 209982 or a complement thereof; 

17 



wherein, said stringent conditions use 50% formamide, 5X SSC, 50 mM sodium 
phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5X Denhardt's solution, sonicated salmon 
sperm DNA (50 jig/ml), 0.1% SDS, and 10% dextran sulfate at 42°C, and washes at 42°C in 
0.2X SSC, at 55°C in 50% formamide followed by a high-stringency wash at 55°C in 0. IX SSC, 
EDTA; wherein said isolated nucleic acid molecule is suitable for use as a PCR primer or probe. 

140. The isolated nucleic acid molecule of Claim 139 that is at least 50 nucleotides or 
above in length. 

141 . The isolated nucleic acid molecule of Claim 139 that is at least 60 nucleotides or 
above in length. 

142. The isolated nucleic acid molecule of Claim 139 that is at least 70 nucleotides or 
above in length. 

143. The isolated nucleic acid molecule of Claim 139 that is at least 80 nucleotides or 
above in length. 

144. The isolated nucleic acid molecule of Claim 139 that is at least 90 nucleotides or 
above in length. 

145. The isolated nucleic acid molecule of Claim 139 that is at least 100 nucleotides or 
above in length. 
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Item 7 was submitted with the Appellants' Response filed on August 24, 2005 and whose entry 
was indicated as acceptable in a telephone conference with Examiner Wegert on August 24, 
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Genetic Instability in Epithelial 
Tissues at Risk for Cancer 



WALTER N.HITTBLMAN 

Department of Experimental Therapeutics, The University of Texas 
M. D. Anderson Canter Center. Houston, Texas 77030, USA 

Abstract: Epithelial tumors develop through a mulUstep process driven by 
genomic instability frequently associated with etiologic agents such as pro- 
longed tobacco smoke exposure or human papilloma virus (HPV) infection. 
The purpose of the studies reported here was to examine the nature of genomic 
instability in epithelial tissues at cancer risk in order to Identify tissue genetic 
biomarkers that might be used to assess an individual's cancer risk and 
response to chemopreventlve Intervention, As part of several chemoprevention 
trials, biopsies were obtained from risk tissues (U., bronchial biopsies from 
chronic smokers, oral or laryngeal biopsies from individuals with premattg- 
nancy) and examined for chromosome Instability using in situ hybridization. 
Nearly all biopsy specimens show evidence for chromosome instability 
t&rougnout me exposed tissue, lncreaseu cnromosome instability was onserved 
With histologic progression In the normal to tumor transition of bead and neck 
squamous cell carcinomas. Chromosome instability was also seen in premallg* 
nant head and neck lesions, and high levels were associated with subsequent 
tumor development In bronchial biopsies of current smokers, the level of 
ongoing chromosome Instability correlated with .smoking intensity (eg., 
packs/day), whereas the chromosome index (average number of cnromosome 
copies per cell) correlated with cumulative tobacco exposure (i.e., pack-years). 
Spatial chromosome analyses of the epithelium demonstrated multifocal clonal 
outgrowths. In former smokers, random chromosome instability was reduced; 
however, clonal populations appeared to persist for many years, perhaps 
accounting for continued long cancer risk following smoking cessation. 

Keywords: chromosome instability; epithelial cells; aerodlgestive tract; 
chemoprevention; cancer risk 



THE NEED FOR BIOMARKERS OP CANCER RISK AND 
RESPONSE TO INTERVENTION 

Epithelial cancers remain a major health challenge in the world. Despite improve- 

meats in staging-and-the application and integration-of-surge^ f -radom«rar^ r and 

chemotherapy, the 5-year survival rate for individuals with lung cancer is only about 
15%.' Even if strategies for early detection are successful and long cancers 
are detected at a stage where local tumor resection and treatment is curative, 
these patients will still be at significant risk for developing second primary tumors 
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associated with the problem of field cancerization. 2 Similarly, for individuals' with a 
first head and neck primary tumor, even if the first malignancy is successfully treat- 
ed, the risk of developing a second primary in the tobacco smoke-exposed field is 
approximately 40%. 3 Similar cancer risk estimates exist for individuals who exhibit 
severe dysplasia in premalignant epithelial lesions. 4 For these reasons, it is important 
to focus on chemopreventive strategies to_ prevent the development of epithelial * 
malignancies. 

Several- problems confront chemoprevention trials designed to identify effica- 
cious agents. 5 First, chemoprevention trials with cancer incidence as a primary end* 
point require tens of thousands of subjects and tens of years of intervention and 
foltow-up for statistical evaluation. For example, a recently reported trial involved 
30,000 subjects and required 10 years in order to examine the impact of prevention 
strategies on lung cancer development, only to find a possible increased lung cancer 
incidence in current smokers who received ^-carotene. 6 

The problem of large, long-term trials results from the difficulty in identifying 
individuals at highest cancer risk who might best benefit from chemopreventive 
intervention. For example, 20 pack-year smokers, while known to be at relatively 
increased risk for developing lung cancer, have approximately a 10% lifetime risk 
for developing lung cancer/ This seriously limits the number of potentially useful 
strategies that ran hp rlinicaUyxxploreri A second proMem faring chemoprevention. 
trials is that little is known about what agents are likely to have efficacy, and even 
less is known regarding proper doses, schedules, and durations of treatment Part of 
the reason for this problem is that too little is known about the physiologic processes 
that drive epithelial cancer development 

In order to reduce the number of subjects and the time required to carry out 
chemoprevention trials and thus allow the exploration of multiple prevention strate- 
gies, two types of advances are necessary. First, it is important to identify individuals 
at significantly increased cancer risk who might best benefit from different types of 
intervention. Second, in order to allow the rapid identification of agents, doses, and 
schedules of potentially efficacious agents, it is necessary to identify and validate 
surrogate endpoints of response that indicate whether the agents are having a posi- 
tive impact on the target tissue during the chemopreventive intervention. 

One approach to identifying individuals at increased aerodigesrive tract cancer 
risk is to explore epidemiologic features of potential subjects. Molecular epidemio- 
logic studies are beginning to identify intrinsic host factors that place some individ- 
uals at increased cancer risk, especially those with a chronic smoking history. 8 Most 
intrinsic factors identified thus far reflect levels of carcinogen metabolism, repair 
capabilities of the host following DNA damage, and other measures of intrinsic 
"cellular sensitiviryto mutagens. While these factors can provide stati^cafl^ighlf" 
. icant risk ratios in case-control studies that are controlled for tobacco exposure, the 
detected risk ratios usually fall in the range of 1.5 to 10. Unfortunately, this is not 
sufficient for the individualization of treatment and is not sufficiently high to signif- 
icantly reduce the numbers of subjects required for chemoprevention trials with 
cancer incidence as the primary endpoirit 

Another approach to identifying individuals at increased cancer risk is to directly 
examine the target tissue of individuals with known carcinogen exposure (e.g., 
chrome tobacco smoke exposure), who have evidence of target organ dysfunction 
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(e.g., chronic obstructive pulmonary disease, changes in voice quality), of who 
have clinical evidence of premalignancy (e.g., bronchial metaplasia/dysplasia, oral 
leukoplalda/erythroplakia, cervical intraepithelial neoplasia). The conventional 
standard for assessing cancer risk in these situations is the degree of histological 
change. However, while individuals who show moderate to severe dysplasia are 
known to be at increased cancer risk when compared to individuals with lesser his- 
tologic changes, it is often difficult to distinguish reactive changes to carcinogenic 
insult from initiated and progressing lesions. Similarly, upon cessation of carcino- 
genic insult, histologic changes may reverse yet cancer risk may continue for many 
years. For example, while smoking cessation is associated with decreased bronchial 
metaplasia, 9 increased lung cancer risk continues for many years beyond smoking 
cessation. 10 In fact, nearly half the newly diagnosed lung cancer cases in the USA 
occur in former smokers." 

The development of assays to identify individuals at high epithelial cancer risk 
and to directly assess response to intervention in the target tissue is therefore an 
important research goal. Such assays should be objective and easily quantifiable and, 
if possible, minimally invasive. Moreover, they should reflect both the disease pro- 
cess and the targeted pathway and thereby be useful in assessing risk and monitoring 
response to intervention as well as directly testing the hypothesized mechanism of 
-acdon-of4he-chemopr ev e ntiv e strategy. — 

In the chemoprevention setting it is important to recognize that one does not 
know the location of the future cancer. Thus, assays must necessarily be carried out 
on random biopsies of the field at risk. Even if there are clinically evident premalig- 
nant lesions, this does not mean that this is the likely site for a future malignancy. 
For example, nearly half of the cancers that develop in individuals with oral leuko- 
plakia arise away from the original index lesion. Similarly, since many newly diag- 
nosed lung cancers arise in the peripheral parts of the lung (e.g., adenocarcinomas), 
especially in former smokers, and since endobronchoscopy predominantly accesses 
central components of the lung, it is important to identity biomarkers that can reflect 
global processes ongoing in the target epithelial field associated with increased can- 
cer risk. Their discovery requires a better understanding of the tumorigenesis pro- 
cess in epithelial fields at cancer risk. 



THE RATIONALE FOR STUDYING 
GENOMIC INSTABILITY AS A MARKER OF RISK 

Tumors of meaero^igestiwtfact have been proposedTbleflecta~ M field canceriza^ 
tion" process whereby the whole tissue is exposed to carcinogenic insult (e.g., tob- 
acco smoke) arid is at increased risk for multistep tumor development 12 - 13 Several 
types of clinical and laboratory data support this notion, including the frequent 
occurrence of synchronous primary and subsequent second primary tumors in the 
aerodigestive tract (frequently exhibiting dissimilar histologies as well as distinct 
genetic signatures 14 " 16 ) and the presence of premalignant lesions that precede and/or 
accompany the tumor in the exposed tissue field. 17 The notion of a multistep tumor- 
igenesis process is further supported by serial clinical and histologic evaluations of 
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target tissue or exfoliated cells where increasing degrees of histological abnormali- 
ties are observed over time. 18 

A working model for aerodigestive tract tumorigenesis is illustrated in Figure 1. 
Tumorigenesis in the face of carcinogenic exposure likely involves a chronic process 
of tissue injury and wound healing. DNA damage induced by the carcinogen is likely 
fixed into permanent genetic changes (e.g., chromosome damage, chromosome non- 
disjunction, gene mutation, gene deletion, etc.) during the process of proliferation. 
This damage would be expected to be distributed throughout the exposed (issue field 
leading to a background of generalized genomic damage (depicted in Figure 1 as a 
background mat of increasing density). Chronic injury and repair likely leads to the 
accumulation of cells with increasing amounts of genetic changes as well as the out- 
growth of abnormal clones (triangles in Rgurb 1) carrying an accumulation of 
genetic changes important for selective survival* dysregulated growth, and preferen- 
tial epithelial take-over by initiated clones (see Figure 2). 

Cellular and molecular evidence for the field carcinogenesis and multistep tum- 
origenesis model comes from many laboratories. 19 * 20 With the advent of a wide array 
of molecular technologies, a large number of specific molecular genetic and epige- 
netic changes involving specific oncogenes, tumor suppressor genes, cell regulatory 
genes, and repair genes have now been described for aerodigestive tract cancers. The 
i dent ifi catio n of these specific molecular changes have now provided probes to 
explore specific events occurring in premalignant lesions adjacent to aerodigestive 
tract tumors. 21 " 24 Frequently, these premalignant lesions showed a subset of the 
same molecular changes found in the associated tumor, suggesting that these lesions 
might represent precursor lesions for the associated tumors (i.e^, a manifestation of 
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FIGURE 1. Field cancerization and multistep tumorigenesis. 
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FIGURE 2. Multiple focal clonal evolution during multistep tumorigenesis. 



a multistep tumorigenesis process). For example, studies of the premalignant lesions 
adjacent to head and neck tumors have provided evidence for a gradual accumulation 
of genetic alterations accompanied by evidence for dysregulation of cellular control 
mechanisms (e.g., alterations in expression of PCNA, EGFR, TGF-fl, p53, and 
cyclinDl). 25 - 28 

These types of studies have now also been applied to the target epithelium of indi- 
viduals at increased risk for aerodigestive tract cancer (i.e., individuals with a chron- 
ic smoking/alcohol history and/or prior aerodigestive tract cancer). Several groups 
(using polymerase chain reaction, PCR, analysis of microdissected epithelium) have 
now demonstrated the presence of clonal outgrowths in the target premalignant epi- 
thelium of individuals at increased risk for cancer. 29-31 For example, examination of 
bronchial biopsies derived from individuals with a 20 pack-year smoking history 
demonstrated that 76% of the cases showed evidence for LOH (3pl4, 9p21, or 
17pl3) in at least one of six lung biopsy sites. On a per site basis, some form of LOH 
was observed in 25% of the sites examined. 29 

If aerodigestive tract cancer development reflects a field cancerizalion process 
involving" multistep events7toen^slfaTfd response mToTmadonsh^Id be able~t^be~ 
derived from random biopsies or exfoliated cells from the field at risk or from assess- 
ments of tissue undergoing similar processes. Hypothetically, lesions exhibiting the 
greatest degree of genomic instability, clonal outgrowth, and abnormal epithelial 
regulation would be at the highest relative aerodigestive tract cancer risk. Similarly, 
an active chemopreventive intervention might be expected to decrease these mani- 
festations of risk. Reduced risk manifestations include decreased levels of ongoing 
genetic instability, decreased frequency of clonal outgrowths, and increased epithe- 
lial growth regulation. 
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THE MEASUREMENT OF CHROMOSOME INSTABILITY USING 
CHROMOSOME IN SITU HYBRIDIZATION 

Modular genetic techniques, while extremely useful for detecting clonal chang- 
es in targets tissues, are somewhat limited in their ability to detect random genetic 
instability. Conventional cytogenetic assays are useful for detecting chromosome 
instability and clonal chromosome changes. However, they require numbers of 
dividing cells for karyotypic analysis that, are difficult to attain in the setting of biop- 
sies acquired during the course of a chemoprevention trial. A technique was there-: 
fore needed that would allow chromosome instability measurements in situations 
where few cells are available (e.g. small biopsies, brushings, or sputum samples) and 
where the target material might be fixed. It was also desirable to have a technique 
that would be adaptable to tissue sections, whereby spatial information could be 
retained and genotype/phenotype associations could be determined on the same or 
adjacent tissue sections. The technique of in situ hybridization (ISH) involves the 
use of DNA probes that recognize either chromosome-specific repetitive target 
sequences, chromosome single gene copy sequences, or sequences along the whole 
chromosome length or chromosome segments. 32 We have adapted the ISH technique 
for formalin-fixed, paraffin-embedded tissue sections and have applied it to a variety 

of tissues, including the aerodigestive tract 33 * 34 

Using probes that label the centromere regions of specific chromosomes, this 
assay permits determination of the average chromosome number per cell for each 
specimen. This assay is also useful for detecting generalized chromosome instability 
during the tumorigenesis process. Normal diploid populations should have two cop- 
ies of each autosomal chromosome and should rarely show three or more chromo- 
some copies per cell (chromosome polysomy), especially in tissue sections where 
nuclear truncation results in an imder-representation of chromosome copy number. 
Thus, the detection of cells with three or more chromosome copies would indicate 
the presence of chromosome instability. 

To examine this technique's potential for characterizing the multistep tumorigen- 
esis process in the aerodigestive tract, we measured the fraction of cells exhibiting 
three or more chromosome copies in apparently contiguous epithelial transitions 
from normal to hyperplastic to dysplastic to carcinomas, all on a single tissue slice 
of head and neck squamous cell carcinomas. 34 In these specimens, greater than 35% 
of the cases of adjacent "normal" epithelium, greater than 65% of the cases of hyper- 
plastic epithelium, and greater than 95% of the dysplastic and tumor regions showed 
evidence of chromosome polysomy. Of interest, similar transitions of chromosome 
instability were observed with at least four different chromosome probes. Similar 
trends have also been observed in amenabl e tissue from other epithelial mali g nan- 
cies, including cervix, bladder, and breast, 35 These results thus suggested that the 
notions of field cancerization and multistep tumorigenesis might apply to several 
epithelial tissues and that measures of chromosome instability might be useful for 
monitoring this process. 

In the situations described above, the premalignant lesions examined might be 
considered to represent epithelium at 100% risk of being in a cancer field, since they 
were located in the adjacent epithelium to the cancer. This then raises the question 
of the nature of genetic instability in the epithelium of individuals at increased risk 
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for developing cancer. To explore this issue, we obtained biopsies daring the course 
of leukoplakia chemoprevention trials exploring the use of 13-c/>retinoic acid in 
reversing leukoplakia and probed them for genetic instability using in situ hybridiza- 
tion. In one retrospective study and in one prospective study of subjects with oral* 
leukoplakia, the results indicate that those subjects whose pretreatment biopsies har- 
bor relatively high levels of genomic instability (i.e., more than 3% of the cells 
examined showing at least 3 chromosome 9 copies per cell) have a significantly 
higher likelihood of suffering early onset of head and neck cancer. 36,37 Interestingly, 
half of the tumors that did develop occurred away from the biopsy site used to mea- 
sure genetic instability. This result suggests that genomic instability measurements 
in carcinogen-exposed tissue can provide useful cancer risk estimates. 



THE RELATIONSHIP BETWEEN TOBACCO EXPOSURE AND 
CHROMOSOME INSTABILITY 

In recent years, the aerodigestive tract chemoprevention group at M.D. Anderson 
Cancer Center has initiated three sequential biomarker-associated chemoprevention 
trials involving chronic smokers with a greater than 20 pack-year smoking history. 
-In-eaeh-o^ese^tadieSrendGbiQnchiaUMOpsie 
within the lung, including the carina and at bifurcation points at the upper, middle, 
and lower right lung and at the upper and lower left lung. Biopsies were obtained pri- 
or to and following chemopreventive intervention and were subjected to in situ 
hybridization analysis in addition to analyses for other biomarkers. The first impor- 
tant finding was that some degree of chromosome polysomy was evident in all lung 
sites examined, and this was observed independently of the particular chromosome 
probe utilized. 38 This finding supports the notion that random chromosome changes 
may be occurring throughout the exposed lung field. 

In a second study, bronchial biopsies were obtained from individuals with a 20 
pack-year smoking history. In this study, most of the subjects involved were current 
smokers. 39 Interestingly, all cases who showed metaplasia at one of six biopsy sites 
also showed chromosome polysomy in at least one biopsy site; overall, 88% of the 
sites showed some evidence of chromosome 9 polysomy. 40 Evidence for genetic 
instability was also detected in patients who did not show evidence of bronchial 
metaplasia in any of six biopsy sites despite a strong smoking history. In fact, more 
than 90% of the cases and more than 60% of the sites showed significant chromo- 
some polysomy (i.e., at least three copies in at least 2 % of the cells examined). 
These results suggest that the lungs of long-term smokers show significant evidence 
of g enetic instab ility, an d this instability can be detected throughout the accessible, 
bronchial tree, even when bronchial metaplasia is not evident. . 

These studies in current smokers has allowed us to examine the relationship 
between the levels of genetic instability detected and subject characteristics such as 
smoking status (current or former), smoking history, and lung tissue pathologic 
changes. Evaluable biopsy material has now been obtained from more than 108 cur- 
rent smokers, including more than 480 evaluable biopsy sites. The mean metaplasia 
index in these current smokers was 30.4%. For the total population studied, the 
median chromosome index for the bronchial biopsies was 1.41 (range, 1.04-1.61) 
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and the median chromosome polysomy index was 2.0% (range 0-8.7%). This can be 
compared to a mean chromosome index between 1.2-1.4 for lymphocytes and very 
rare chromosome polysomy. Interestingly, the intrasubject variability in chromo- 
some instability was relatively low in most subjects and was less than the intersub- 
ject variability, These results suggested that chronic smokers harbor detectable 
chromosome instability throughout the accessible bronchial tree (supporting the 
field carcinogenesis notion) and that information from one biopsy site might yield 
representative information for the rest of the lung field 

Since most of the current smokers exhibited bronchial metaplasia in at least one 
of the biopsied sites, this allowed us to examine the relationship between chromo- 
some instability and histologic changes, both on a site-by-site basis and on a per case 
basis. On a site-by-site basis, the chromosome indices of lesions showing squamous 
metaplasia were similar to those not showing metaplasia (i.e., median 1.43 vs. 1.43), 
arid the degree of chromosome polysomy in metaplastic lesions were only slightly 
higher than in non-metaplastic sites (medians: 2.2% vs. 1.8%, respectively). Thus, 
the presence or absence of squamous metaplasia at a biopsy site does not necessarily 
correlate with the degree of underlying genomic instability. On the other hand, those 
subjects with metaplasia indices of at least 15% also showed higher levels of chro- 
mosome polysomy than did subjects with metaplasia index below 15% (medians: 
— 2 . 4% VS, 1.8%. P - 0.005). Thus, these chromosome instability assessments in cur- 
rent smokers appeared to reflect a more global process in the lung field. 

Tobacco exposure has been shown to significantly increase the risk of developing 
lung cancer, and the degree of risk is related to the extent of tobacco exposure. We 
were interested in determining the relationship between individuals* smoking histo- 
ry parameters and the levels of chromosome change found in their lungs following 
years of tobacco exposure. While there was significant mtersubject variation for sim- 
ilar tobacco exposure histories, overall there was a significant correlation between 
the degree of chromosome polysomy and the intensity of ongoing tobacco exposure 
(packs/day, p = 0.02 on a per site basis) and with the extent of tobacco exposure 
(pack-years, p = 0.003). Thus the amount of chromosome polysomy reflects the 
intensity and extent of tobacco exposure. At the same time, individuals with similar 
smoking histories showed widely divergent amounts of chromosome polysomy, pos- 
sibly reflecting differences in intrinsic sensitivity between subjects. There was. also 
strong correlation between the chromosome index and the duration of the smoking 
history (smoking years) and total accumulated exposure (pack-years, p = 0.0001). 
These results suggest that tobacco exposure is associated with the initiation and 
accumulation of chromosome instability in the exposed lung; however individuals 
are differentially sensitive to carcinogenic insult The working hypothesis is that 
those individuals who accumulate the hi g hest d egree of chromosome changes will 
be at the highest lung cancer risk. 

Many of the bronchial biopsies from chronic smokers examined by in situ hybrid- 
ization showed a rise in the chromosome index above that expected for a diploid cell 
population, especially in subjects with an extensive smoking history. The rise in 
chromosome index was also accompanied by an increase in the fraction of cells 
exhibiting at least 3 chromosome copies per cell. To determine if a rise in the tissue 
chromosome index was due to clonal expansion of populations with chromosome tri- 
somy, the chromosome copy number and relative coordinates of each cell scored in 



HUTELMAN: GENETIC INSTABILITY IN TISSUES AT RISK FOR CANCER 9 

the bronchial epithelium was recorded and a spatial genetic map was created. 41 We 
then developed algorithms for calculating localized chromosome indices within the 
tissue. Since trisomic clones would have, on average, three chromosomes instead of 
. two, those cells involved in neighborhoods with chromosome indices three-halves 
that of diploid populations could be marked as being part of a trisomic clone. Simi- 
. larly, groups of cells with chromosome indices half that of diploid populations could 
be marked as being part of a monosomic clone. This allowed the generation of a sec- ' 
ond-order, two-dimensional genetic map representation of the bronchial epithelium 
showing the relative locations of cells involved in monosomic and trisomic clonal 
outgrowths. When adjacent tissue sections from the same bronchial biopsy were 
probed separately for different chromosomes, the detected clones appeared to occu- 
py separate subregions of the epithelium. This result suggests that not only are the 
lungs of chronic smokers undergoing a process of genetic instability, they are expe- 
riencing the outgrowth of multiple clones throughout the exposed lung field, as pos- 
tulated by the models shown in Figures 1 and 2. One advantage of this clonal 
approach is that the contribution of both monosomic and multisomic clones can be 
detected. 

Since smoking cessation has been suggested to reduce the lung cancer risk, it was 
of interest to determine whether the levels of chromosome instability would decrease 
.fo llowing smoking cessation. This question was possible to examine because our 
third sequential chemoprevention trial involved subjects who had discontinued 
smoking. So far, more than 220 subjects (more than 650 biopsies) who have quit 
smoking (mean 9.9 quit-years) have been evaluated for chromosome instability in 
their lungs. Despite the fact that the mean metaplasia index in this group is 5.8% 
(considerably less than that in current smokers), chromosome instability is still 
observed in the majority of subjects. 42 While the mean chromosome polysomy level 
is reduced to 1.0%, some individuals continue to show polysomy levels above 5%. 
Interestingly, while the overall chromosome polysomy levels were reduced in these 
individuals who stopped smoking, the mean chromosome index remained at about 
1 .4 with some individuals exhibiting chromosome indices as high as 1 .8. Initial chro- 
mosome mapping studies suggest that while random chromosome instability seems 
to decrease following smoking cessation, the clonal outgrowths may remain for 
many years in the lung. Hie working hypothesis is that those individuals who show 
the greatest degree of remaining chromosome instability are at the highest lung can- 
cer risk despite smoking cessation. Long-term follow-up on these subjects will be 
necessary to test this hypothesis. 



- SUMMARY-AND CONCLUSIONS 

Aerodigestive tract tumorigenesis appears to be a multistep process taking place 
throughout the tissue fields of exposure. When viewed in the context of chromosome 
changes, carcinogen exposure appears to be associated with the random acquisition 
Of chromosome polysomy throughput the exposed field, the degree of which is relat- 
ed to the degree and extent of carcinogen exposure as well as to the instrinsic suscep- 
tibility of the exposed individual. Continued exposure leads to continued acquisition 
of new changes and, in association with chronic wound-healing processes, to the 
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accumulation of clonal outgrowths throughout the target tissue. Although the ulti- 
mate malignancy may occur in only one or few tissue sites, manifestations of the 
instability process that drives tumorigenesis is globally present in the tissue. Thus 
random biopsies may provide useful risk information for the exposed field as a 
whole. Even when carcinogen exposure is reduced or chemopreventive strategies are 
initiated and histologic manifestations of the tumorigenesis process subside, the 
genetic scars of prior exposure remain in the form of clonal outgrowths and may 
explain continued lung cancer risk in ex-smokers. Future chemoprevention strategies 
need to focus on reducing the degree of chromosome instability and on trying to 
eliminate residual abnormal clonal outgrowths in the aerodigestive tract In this set- 
ting, the measurement of chromosome instability in the target tissue will be useful in 
assessing cancer risk as well as response to intervention. 
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Abstract 

Early identification and subsequent intervention are 
needed to decrease the high mortality rate associated with 
iung cancer. The examination of bronchial epithelium for 
genetic changes could be a valuable approach to identify 
individuals at greatest risk. The purpose of this 
investigation was to assay cells recovered from 
nonmalignant bronchial epithelium by fluorescence in situ 
hybridization for trisomy of chromosome 7, an alteration 
common in non-small cell lung cancer. Bronchial 
epithelium was collected during bronchoscopy from 16 
cigarette smokers undergoing clinical evaluation for 
possible lung cancer and from seven individuals with a 
prior history of underground uranium mining. Normal 
bronchial epithelium was obtained from individuals 
without a prior history of smoking (never smokers). 
Bronchial cells were collected from a segmental bronchus 
in up to four different lung lobes for cytology and tissue 
culture. Twelve of 16 smokers were diagnosed with lung 
cancer. Cytologies! changes found in bronchial epithelium 
included squamous metaplasia, hyperplasia, and atypical 
glandular cells. These changes-were present in-33,-12, and - 
47% of sites from lung cancer patients, smokers, and 
former uranium miners, respectively. Less than 10% of 
cells recovered from the diagnotic brush had cytological 
changes, and in several cases, these changes were present 
within different lobes from the same patient Background 
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frequencies for trisomy 7 were 1.4 ± 03% in bronchial 
epithelial cells from never smokers. Eighteen of 42 
bronchial sites from lung cancer patients showed 
significantly elevated frequencies of trisomy 7 compared 
to never smoker controls. Six of the sites positive for 
trisomy 7 also contained cytological abnormalities. 
Trisomy 7 was found in six of seven patients diagnosed 
with squamous cell carcinoma, one of one patient with 
adenosquamous cell carcinoma, but in only one of four 
patients with adenocarcinoma. A significant increase in 
trisomy 7 frequency was detected in cytologicaBy normal 
bronchial epithelium collected from four sites in one 
cancer-free smoker, whereas epithelium from the other 
smokers did not contain this chromosome abnormality. 
Finally, trisomy 7 was observed in almost half of the 
former uranium miners; three of seven sites positive for 
trisomy 7 also exhibited hyperplasia. Two of the former 
uranium miners who were positive for trisomy 7 
developed squamous cell carcinoma 2 years after 
collection of bronchial ceils* To determine whether the 
increased frequency of trisomy 7 reflects generalized 
aneuploidy or specific chromosomal duplication, a 
subgroup of samples was evaluated for trisomy of 
chromosome 2; the frequency was not elevated in any 
of the cases as compared with controls. The studies 
described in this report are the first to detect and 
quantify the presence of trisomy 7 in subjects at risk for 
lung cancer* These results also demonstrate the ability to 
detect genetic changes in cytologically normal cells, 
suggesting that molecular analyses may enhance the 
power for detecting premalignant changes in bronchial 
epithelium in high-risk individuals. 

Introduction 

Although lung cancer is the leading cause of cancer death in the 
.United-States-(l)* early_detection_ and intervention, could _de>_ 
crease ihe high mortality rate associated with this disease if 
sensitive screening approaches could be developed (2-4). Early 
detection may be feasible because the entire respiratory tract is 
exposed to inhaled carcinogens; therefore, the whole lung is at 
risk for developing multiple, independently initiated sites. This 
"field cancerization" condition (5) is supported clinically by a 
high frequency of second primary tumors in lung cancer pa- 
tients (6-9) and by the occurrence of progressive histological 
premaJignant changes throughout the lower respiratory tract of 
cigarette smokers (i0 f 1J); Moreover, recent studies using 
pathological tissues obtained after lung resection or autopsy 
have identified genetic aberrations associated with lung cancer 
in nonmalignant bronchial epithelium adjacent to tumors (12- 
16). 

Although examination of pathological samples is useful 
for identifying genetic changes associated with carcinogenesis, 
this invasive approach for collection of clinical samples nec- 
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essary for early detection would not be appropriate for screen- 
ing. However, bronchial epithelial cells harvested using routine 
clinical procedures could be examined for genetic changes as an 
initial approach for detecting individuals at high risk for lung 
cancer. This approach could also provide genetic markers for 
evaluating the effectiveness of chemoprevention regimens. 
Bronchoscopy provides direct access to viable cells within the 
airways and is a commonly used tool for obtaining samples 
from the lower respiratory tract, including bronchial epithelium 
(17). This procedure can be used to repeatedly sample the 
bronchial epithelium over time and to collect viable cells that 
can be expanded through tissue culture for functional assays. 

Because of field caacerization, genetic abnormalities 
should be dispersed throughout the bronchial epithelium of 
persons at risk for lung cancer. The purpose of this investiga- 
tion was to test this hypothesis by sampling nonmalignant 
bronchial epithelium from distinct locations within four differ- 
ent lobes of the lung from persons at risk for lung cancer and 
then assaying the bronchial cells for the presence of specific 
genetic abnormalities. Trisomy of chromosome 7 was exam- 
ined in these cells, because this alteration is common in solid 
tumors, including lung cancer, of several different organ sys- 
tems (18, 19). In addition, trisomy 7 has been detected in 
premalignant lesions such as villous adenoma of the colon (20), 
in the colonic mucosa of individuals with familial polyposis 
(21 ), and in the far margins of some resected lung tumors (22). 
Our results demonstrate that trisomy 7 can be detected in 
nonmalignant bronchial epithelium from patients with lung 
' cancer distant to the site of the tumor and in individuals without 
tumors who are at high risk for lung cancer development. 
Together, these studies suggest that an extra copy of chromo- 
some 7 may be an intermediate biomarker of ongoing field 
carcinogenesis. 

Materials and Methods 

Subject Recruitment. Bronchial epithelium was collected 
from 16 cigarette smokers undergoing a diagnostic workup for 
possible lung cancer and from 7 individuals with a prior history 
of underground uranium mining, 5 of whom were also smokers. 
Three individuals who had never smoked were also recruited to 
obtain bronchial epithelium not exposed directly to either to- 
bacco smoke or radon progeny. 

Pathology and Exposure History. Twelve of the 16 cigarette 
smokers who underwent diagnostic bronchoscopy were diag- 
nosed with NSCLC. 3 Seven tumors were characterized histo- 
logically as SCCs, four tumors were ACs, and one tumor was 
an adenosquamou s cell c arcinoma. Lung cancer was not evi- 
dent in the other four subjects. Smokin^hisltor ies ranged "from 
15 to 120 pack-years (defined as the number of cigarettes 
smoked per day times the number of years smoked). All of the 
former uranium miners worked underground between 2 and 20 
years, with a range of 27-527 working level months. Five of the 
seven miners had smoking histories that ranged from 20-60 
pack-years. 

Bronchoscope Collection and Processing of Bronchial Ep- 
ithelium. A protocol was developed for harvesting viable 
bronchial epithelium from the lower respiratory tract using a 
standard cytology brush during bronchoscopy. After iniroduc- 



? The abbreviations uwd are: NSCLC, nou-sma:) cell lung cancer; SCC, squa- 
mous cell cancer. AC. adenocarcinoma; EGFR. epidermal growth fatter receptor: 
FISH, fluorescence in situ hybridization: LOH. loss of heterozygosity; BEGM, 
Bronchial Epithelium Growth Medium. 



tion into the lower respiratory tract, the bronchoscope was 
directed into each upper and lower lobe, and the carina! margin 
of a segmental orifice, usually the second and third birurca UOa 
within the upper and lower lobes, respectively, was brushed 
These sites were chosen because (a) they arc uigh-depo s fciorj 
areas for particles; (b) they are associated frequently with 
histological changes in smokers; and (c) they represent sites 
where tumors commonly occur (11, 23). The area was first 
washed with saline to remove any nonadherent cells. Sites were 
not brushed if a tumor was visualized within 5 cm of the site" 
After brushing, the brush was withdrawn, placed in serum-free 
medium, and kept on ice until processed. Each site was brushed 
twice. The procedure was well tolerated by all subjects, and no 
complications were noted related to the brushing procedure. 

Bronchial cells were collected from only two of the sites 
in two of the subjects, from three sites in two subjects, and from 
all four sites in the remaining subjects. Although only two sites 
were brushed initially in case 1, cells were obtained from all 
four sites in this subject during a repeat bronchoscopy per- 
formed after the initial procedure did not yield a diagnosis. 
Samples were obtained from all four sites in the cancer-free 
current smokers and in the never smokers. In addition, bron- 
chial epithelial cells derived at autopsy by Oonetics, Inc. (San 
Diego, CA) from four never smokers were also obtained to 
serve as additional controls. Only two sites sampled from most 
of the former uranium miners were available for analysis be-: 
cause cells recovered from the other sites had been used ex- 
clusively for cytology in another investigation. 4 
Bronchial Epithelial Cell Culture. Replicative cultures of ibev 
bronchial epithelial cells obtained by the procedure described-'" 
above were established in our laboratory (24) using a scruns^ 
free medium (BEGM; Clonetics, Inc.) that is optima; for growm : yo v; . 
of these cells. Cells were removed from brushes by vigorous^ 
shaking in BEGM; cells from one brush were prepared for^; 
cytological analyses, and cells from the other brush were;£; 
washed, resuspended in BEGM, scedeci onto 60-mm fibrdnec^ 
tin-coated plates, and grown at 37°C in 3% C0 2 and 21% Q|§f 
until 80% confluence. Prior to passage, aliquots of cells wert£§| 
cryopreserved and stored at — 145°C; other samples of cc#$f| 
were fixed in methanol-acetic acid (3:1). Next, the cells wer^||| 
washed [bur to six times in methanol :acetic acid and theife|| 
dropped onto slides (about 2 X 10 5 cells/slide). The effects 
cell culture on the frequency of trisomy 7 in nonmalignanl^| 
bronchial epithelium were examined by placing cells dispersed^ 
from brushes directly onto microscope slides followed by fef^" 
ation. 

Cytology. Cells from one brush from each bronchial coilcctr6if||! 
site-were prepared for cytological analysis by smearing the ceM^| 
across a microscope slide. The cells were then fixed with 96^£|| 
cthanol and stained according to the Papanicolaou procedural 
(25) to facilitate morphological evaluation by a cytopatboiogisto||| 
Detection of Trisomy 2 and Trisomy 7. Trisomy 2 and: tr^J 
somy 7 were determined by hybridization of cells with ^bioti^g 
nylatcd chromosome 2 or 7 centromere probe (Oncor; Gaith^^ 
crsburg, MD), The probes were denatured in hybridizfiticf|g| 
buffer at 70°C for 5 min, and the slides were immersed in 70?^"^ 
forinamide-2x SSPE at 70°C for 2 min. The probe was ! 
applied to the slides, which were incubated in a humidif 
chamber at 37°C for 16 h. After incubation, the slides *^|| 
washed in 0.25 x SSPE (10 mM sodium phosphate monobasg 
monohydrate; 1 mM ethylenediamine tetraacetic acid disodfo$| 



J Unpublished data. 



Ma+oriol maw ha nm+oo+oH hw rnwmnhi Is»\a/ /Ti+lo 1*7 II Q C*r\ria\ 



Epidemiology, Blomarkcrs & Prevention 633 



yt was 
margin 
tcation 
rushed, 
osition 
y with 
it sites 
as first 
:s were 
ae site, 
ra-free 
rushed 
and no 
dure, 
le sites 
d from 
r o sites 
om all 
»y per- 
gnosis. 
er-free 
. bron- 
(San 
ned to 
a most 
sis be- 
ed ex- 

; of the 
bribed 
ierum- 
irowth 
aarous 
ed tor 
were 
ronec- 
\%0 2 
5 were 
f cells 
s were 
i then 
xls of 
ignant 
persed 
3y fix- 

ection 
"ecells ~ 
h96% 
:edurc 
logist. 

ad tri- 

bioti- 
Gaith- 
zation 
1 70% 
s then 
idified 

were 
abasic 
odium 



salt, dihydrate; 150 dim sodium chloride, pH 7.4) for 5 min at 
72°C, and the probe was detected with fluorescein-labeled 
avidin. Cell nuclei were visualized with propidium iodide. 
Data Analysis. The number of centromeric hybridization sig- 
nals in each cell were evaluated in 400 cells/slide, and the 
frequency or trisomy 7 on each slide was calculated by dividing 
the total number of cells expressing three hybridization signals 
by the total number of cells counted on each slide. Twenty % 
of the slides were scored by a second person, and frequencies 
tor trisomy 7 differed by <0.4%. Hie iota) number of sites 
positive for trisomy 7 in subjects with SCC and AC were 
compared using Fisher's exact) test 

Results 

Cytology. Squamous metaplasia and atypical glandular cells, 
the only cytological abnormalities observed in lung cancer 
patients, were present in 32% of the samples (Table 1). These 
cytological changes were observed in <10% of the cells re- 
covered from the diagnostic brush. Two subjects had three sites 
with cytological abnormalities, and five subjects had no cyto- 
logical abnormalities. No samples contained tumor cells by 
cytology, although one of four sites in five subjects was col- 
lected from the same lobe where a tumor was later diagnosed. 

Two of the 16 sites in smokers without lung cancer were 
cytologicalry abnormal (both in the same person; Table 2), 
whereas no atypical cells were present in the 12 sites from the 
three never smokers (Table 3). In former uranium miners, 
hyperplasia was present in bronchial cells collected from all 
four sites from one person, and in one site in two additional 
people (Table 2). 

Culturing of Bronchial Epithelial Cells. The efficiency of 
establishing repiicative cultures of the cells obtained by bron- 
chial brushing was 100%. The serum-free medium used for 
these cultures is optimal for growing bronchial epithelial cells 
and does not support fibroblastic cell replication (25). There- 
fore, the cells were uniformly epitheloid in appearance. Growth 
potential was evaluated by passaging cells from all seven of the 
uranium miner cases and cases 1-6 from the lung cancer 
patients. Some of these cultures were maintained for up to nine 
passages (a minimum of 16 population doublings), and many 
underwent 30 divisions before senescence. However, none ex- 
hibited an indefinite population-doubling potential. 
Detection of Trisomy 7 in Nonmalignant Bronchial Epithe- 
lium. Background rates of trisomy 7 were determined by ex- 
amining normal human bronchial epithelial cell lines derived 
from autopsy cases of never smokers and bronchial epithelium 
c^ollec^d"froTn 

chial cell lines (passage 2) from four donors and bronchial 
epithelial cell samples obtained by bronchial brushing from the 
recruited never smokers (Table 3), only 1.4 ± 0.3% (SD) of the 
cells contained three hybridization signals for chromosome 7 
with values ranging from 1 to 1.8%. These values agree with 
those reported by the manufacturer of the probe. Therefore, 
trisomy 7 frequencies of >2.0% (>2 SD above the mean for 
controls) were considered sigrrificandy different from controls. 

Passage 1 or 2 bronchial cells from lung cancer patients 
were examined for trisomy 7. Eighteen of the 42 bronchial sites 
(43%) sampled from the 12 lung cancer patients contained 
trisomy 7 at frequencies ranging from 2.3 to 6,0% (Table 1 ; Fig. 
1). Three subjects (cases 1, 2, and 11) displayed trisomy 7 in all 
sites collected during bronchoscopy, and in two subjects (cases 
7 and 12), trisomy 7 was found in three of four sites (Table 1). 
Six of the 18 sites positive for trisomy 7 also contained cyto- 
logically abnormal cells. Trisomy 7 was found in six of seven 



Table J Frequency of trisomy 7 in bronchial epithelial cells from lung cancer 
patient* . 



Case 


Age 


Smoking 
(pack-yrs) 


Tumor 

diagnosis 


Brush 
location 


CytoJogicai 
diagnosis 


Trisomy 7 
(frequency, %) 


1 


64 


10* 


SCC 


RLL° 


N 


2.8* 










RUL 


AGC 


4.0* 










RU/ 


N 


3.0* 










RUIZ 


N 


4.0* 










LLL r 


N 


6.0* 










LUL' 


SM 


4.3* 


2 


69 


26 


SCC 


RUL 


SM 


2.8* 










LLL 


SM 


3.3* 










LUL 


N 


3.8* 


3 


65 


120 


SCC 


RLL 


AGC 


2.0 






. - 





RUL 


-AGC 


. _ 2.3* 










IXL 


AGC 


2.0 , 


4 


52 


90 


AC 


RLL 


SM 


1.5 










RUL 


N 


1.8 










LLL 


SM 


IJ 










LUL 


SM 


1.8 


5 


70 


50 


SCC 


RLL 




IJ5 










RUL 


N 


13 










LLL 


N 


1.5 










LUL 


SM 


1.3 


6 


61 


93 


AC 


RLL 


N 


1.5 










RUL 


N 


1.3 










LLL 


N 


2.0 










LUL 


' N ■ 


1.5 


7 


58 


40 


SCC 


RLL 


N 


1.8 










Rl/L 


N 


2.3* 










LLL 


N 


2.5* 










LUL 


N 


2.8* 


s 


59 


120 


AdSCC 


RLL 


N 


1.5 










RUL 


N 


2.0 










LLL 


N 


2.5* 










LUL 


AGC 


2.0 


9 


65 


71 


SCC 


RLL 




2.0 










RUL 


SM 


2.5* 


10 


63 


45 


AC 


RLL 


N 


1.0 










RUL 


N 


1.8 










LLL 


N 


1.8 










LUL 


N 


1.3 


11 


61 


95 


AC 


LLL 


N 


2.5* 










LUL 


N 


2.8* 


12 


76 


17 


SCC 


RLL 


N 


2.0 










RUL 


N 


2.3* 










LLL 


N 


2.3* 










LUL 


N 


2.3* 



" RLL, right lower lobe; RUL, right upper lohe; LLL. left lower lobe; LUL, left 
upper lobe; AGC, atypical glandular cells; SM, squamous metaplasia; N, normal 
cells; AdSCC. adenosquamous carcinoma. 
b ~P~<T0~Q5 a.fcom~par«l to ncver-smbkef controls. 
c Resampled 4 months later. 



patients diagnosed with SCC, whereas only one of four patients 
with AC displayed trisomy 7 in any site collected at bronchos- 
copy. Case 7, which had histological features of both SCC and 
AC had one site positive for trisomy 7. The frequency of 
positive trisomy 7 sites in all patients with SCC within this 
small sample population was significantly greater than in AC 
patients (P < 0.005): 

The reproducibility of detecting trisomy 7 at sites found to 
be positive for this abnormality was investigated in one patient 
(case 1) who required repeat bronchoscopy for clinical reasons. 
Trisomy 7 was increased similarly in the two sites brushed 
during both procedures, although cytological cxaminadon 
showed atypical cells in one site from the first bronchoscopy 
and cytological ly normal cells from the same site collected 
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Table 2 Frequency of trisomy 7 in bronchia? epithelial cells from cancer-frcc 
smokers and former uranium miners 



Case Age 


Smoking 
(pack-yrs) 


Radon exposure 
(WLMsr 


Brush 
location 


Cytological 
diagnosis 


Trisomy 7 
(frequency, %) 


13 81 


15 


0 


RLL 


N 


1.8 








RUL 


AGC 










LLL 


N 


1.8 








LUL 


SM 


2.0 


14 34 


24 




RLL 


N 


1 1 
i.j 








RUL 


N 


JO 








LLL 


■v 


1 Af 








LL : L 


N 


■ i 


15 68 


5J 


q 


RLL 


IN 


4.1T 








AUL 


M 


* A/» 

O.v 








r t i 


yj 
iN 


4.3 










IN 


3.5 


16 45 


30 


u 


PI T 




1 J 








RUL 


N 


1.5 








J J J 


N 


2.0 








r T it 

LUL. 


N 


1.8 


1*7 so 


o 
o 


27 


LLL 


N 


3.0* 








LUL 


N 


3.0* 






5J6 


LUL 


N 


1.3 








RUL 


N 


3.3* 


19 64 


30 


235 


LUL 


N 


1.5 








RLL 


N 


1.0 


20 56 


0 


J86 


LUL 


N 


2.0 








RLL 


N 


23" 


21 64 


0 


214 


RLL 


H 


1.8 


22 64 


*> 


577 


LUL 


N 


1.8 








RLL 


H 


0.8 


23 67 


31 


124 


LLL 


H 


i.3 








LUL 


H 


2.R* 








RLL 


H 


2.5" 








RUL 


H 


33'' 



Table 3 Interphase analysis of crrromosornc 7 in norma] human bronchial 
epithelial cell* 



a Abbreviations arc as indicated in Table 1 foomotc. WLM. working level month: 
H. hyperplasia. 

* P < 0.05 as compared to never- smoker controls. 



during the second procedure (Table 1). The other two sites 
collected during the second bronchoscopy also showed elevated 
frequencies of trisomy 7 in this patient 

Trisomy 7 was detected in cytoiogicaliy normal bronchial 
epithelium collected from four sites in one (case 15) of the 
cancer-frcc smokers (Table 2). Bronchial cells from the other 
smokers did not contain this chromosome abnormality. In the 
former uranium miners (cases 17-23), seven of 15 sites col- 
lected during bronchoscopy were ; positive for trisomy 7. Three 

contained basal cell hyperplasia. However, the other four sam- 
ples positive for trisomy 7 showed no cytological abnormality. 

Two of the former uranium miners (cases 18 and 23) 
developed lung cancer within 2 years of bronchial ceil collec- 
tion. SCC was diagnosed in the right upper lobe of both sub- 
jects. As noted in Table 2, both cases were positive for trisomy 
7 in the right upper lobe brushing site obtained at the initial 
bronchoscopy. 

Tissue Culture Effects on Trisomy 7 Expression in Bron- 
diial Epithelium. The effect of tissue culture on trisomy 7 
frequency was assessed by comparing the frequency of this 
chromosome abnormality in freshly isolated bronchia! epithe- 
lium obtained directly from bronchial brushes ("preculture") to 
passage 1 cells. This comparison was conducted on ceils col- 
lected from two different bronchial sites in three different 
subjects [(cases 11 and 16 and donor 7 (never smoker)]. Cul- 
tured samples positive for trisomy 7 in case 11 were also 



Bronchial epithelial cell lines were established from never smokers (Clonetksi 
after antopsy and from volunteers. The normal distribution of chromosome 7 coov 
number as delected by KiSW is shown by the percentage of ceii* exhibiting 
3. or 4 hybridization signals. Four hundred cells containing hybridization 
were counted per donor. 



Donor 


Age 


Brush 
location 


i 


Numher of hybridization 
signals/ceU (%) 

2 3 




4 


1 


6 


KA° 


3.5 


92.0 


i.5 


3.0 


2 


17 


NA 


23 


95.5 


i.3 


1.0 


3 


15 


NA 


15 


94.7 


1.8 


2.0 


4 


41 


NA 


2.0 


94.8 


1.0 


2.3 


5 


45 


RLL 


1.0 


953 


1.8 


1.7 






RUL 


0.5 


98J 


t.O 


0.2 






LLL 


IJ 


96.5 


1.0 


1.2 






LUL 


1.0 


963 


1.2 


1.5 


6 


35 


RLL 


J.0 


96.8 


1.0 


1.2 






RUL 


2.5 


93.3 


1.7 


13 






LLL 


2.0 


94.8 


1.5 


1.7 






LUL 


1.8 


94.2 


i.8 


2.2 


7 


33 


RLL 


0.5 


98.2 


0.8 


0.5 






RUL 


0.5 


97.2 


1.3 


1.0 






LLL 


1.2 


96.8 


1.3 


0.7 






LUL 


1.0 


96.0 


13 


1.5 



rt Abbrcviatioas are as indicated io tbe legend to Table 1. NA. not applicable. 



positive in preculiure cells from die same bronchial collection 
site, whereas sites negative for trisomy 7 in cultured cells from 
case 16 and the never smoker were also negative in precutiarc 
cells (data not shown). Values for trisomy 7 differed by <0.3% 
between precuJturc and cultured cells. The effect of passaging 
cells on the frequency of trisomy 7 was also examined in 
bronchial cells from case 1. Trisomy 7 frequency was similar in 
cells from passages 1, 4, and 7. 

Frequency of Trisomy 2 in Nonmalignant Bronchial Epi* 
thelium. Aneuploidy has been detected in bronchial squamous 
metaplasia, a likely precursor to SCC (26). To determine 
whether the increased frequency of trisomy 7 detected in the 
current study reflects generalized aneuploidy or a specific chror 
mosomal duplication, a subgroup of samples was evaluated for 
trisomy of chromosome 2. The frequency of trisomy 2 in never 
smokers was 1 .5 i 0,4% (data not shown). Bronchial cells 
from eight subjects, six of whom had elevated frequencies for 
_ lris ? rn >' 7 * were evaluated . The f requenc y for trisomy of chro- 
rnosome 2 did not differ from never smokers (Table 4). 

Discussion 

The studies described in this report are the first to detect and 
quantify an increase in trisomy 7 in the airway cells of subjects 
at risk for lung cancer. The presence of trisomy 7 appeared to 
be a specific chromosome gain and not due to generalized 
aneuploidy in these cells. In addition, trisomy 7 in" nonmalig- 
nant epithelium from lung cancer patients was associated with 
SCC tumor histology, suggesting that patients with this genetic 
change may be at greater risk for developing SCC than ether 
histological forms of lung cancer. This supposition was sup- 
ported by the fact that two canccr-free former uranium miners 
with bronchial ceils positive for trisomy 7 ultimately developed 
SCC. Finally, these results demonstrate the abUity to detect 
genetic changes in cytoiogicaliy normal cells, suggesting that 
molecular analyses may enhance the power for detecting 
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Fig. A FISH for chromosome 7 iu I 
bronchial epithelial cells. Trisomy 7 
is apparent in one cell from this 
field. Magnification. X530. 




Table 4 frequency of trisomy 2 in bronchial epithelial cells from lung cancer 
patients, canccr-free smokers, a nd former uranium miners 



15 



19 



23 



Tumor diagnosis 


Brush location 


Trisomy 2 
(frequency, %) 


sec 


RLL° 


1.5 




KUL 


' s 




LLL 


1.8 




LUL 


1.0 


sec 


LLL 


1.0 




LUL 


J.O 


sec 


RU. 


1.5 




RUL 


2.1 




LLI. 


1.8 




LUL 


1.5 


AC 


R1.L 


0.3 




RUL 


1.5 




LLL 


0.8 


None 


RLL 


1.0 




RUL 


0.8 




LLL 


1.0 




— LUL- 


. J. 3 ... 








None 


RLL 


1.8 




RUL 


2.0 




LLL 


1.0 




I.UL . . 


1.3 


None 


LUL 


1.9 




RLL 


0.8 


None 


RLL 


1.5 



' Abbreviations are as indicated in legend to Table 1 . 



premaHgnam changes in bronchial . epithelium in high-risk 
individuals. 

Cigarette smoking and the exposure of underground min- 
ers to radon progeny arc both well-established respiratory car- 
cinogens (27, 28). Tobacco smoke contains numerous muta- 
gens and carcinogens, and radon progeny that have been 
inhaled and deposited on the respiratory epithelium release a 



particles capable of damaging DN A (28). Although comparison 
between findings in the cigarette smokers and the former ura- 
nium miners is constrained by the number of participants in the 
two groups, trisomy 7 was found in bolh groups. [These results 
are consistent with the synergism between smoking and radon 
progeny, which suggests commonality in the pathways by 
which the two carcinogens cause iung cancer (29). 

The bronchial brushing" method used for collecting cells 
from the lower respiratory tract is rapid (10-12 min total for 
two brushes at four different sites), well solerated by the patient, 
and permits collection of viable bronchial cells that can be 
expanded through tissue culture at 100% efficiency. The sta- 
bility of these cells in cukure was evident by the fact that the 
frequency of trisomy 7 did not differ between primary brush 
cells and cells propagated for up to seven passages. Further- 
more, this procedure is amenable to the production of sufficient 
cell numbers (1 X 10 8 ) at low passage (one or two) to accom- 
modate multiple molecular analyses. Although the media used 
in culturing of bronchial epithelial cells did not appear to 
"provfde^lielectiv^ 

additional chromosome 7. the modulation of medium supple- 
ments might lead to the establishment of clonal populations of 
premalignant cells. Such cell populations would greatly facil- 
itate the identification of additional early gene changes in 
respiratory carcinogenesis. 

The detection of trisomy 7 in multiple nonmalignant sites 
within the bronchial tree supports the theory of field cancer- 
ization (5), which stales that diffuse exposure of the entire 
respiratory tract to inhaled carcinogens causes the development 
of multiple, independently initiated sites lhai can lead to tumor 
development. Although the frequency of this chromosome ab- 
normality was relatively low (2.3-6.0%), these values were 
consistent with the low percentage of cells within each brash 
sample (10%) that exhibited abnormal cytology. These results 
are also similar to studies of chromosome gain in patients with 
head and neck cancer where trisomy 7 was detected at frequen- 
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cies of 2, 3 t and 2 1% in histologically normal, hyperplastic, and 
dysplastic cells respectively (30). 

The detection of trisomy 7 in normal, hyperplastic, and 
metaplastic bronchial epithelium from cancer-free patients ex- 
tends a recent report describing LOH at chromosomes 3p, 5q, 
and 9p in dysplastic prexnalignant bronchial lesions harvested 
from current and former smokers by bronchoscopy (33). The 
inability to detect LOH at these chromosome loci in normal or 
early premalignant epithelium may stem from a difference in 
sensitivity between the methodologies used The low frequency 
of trisomy 7 and cylologically abnormal cells collected from 
bronchoscopy is consistent with a lack of clonality within the 
brush cells. FISH assays on interphase cells permit screening of 
individual cells, and sensitivity for detection is limited only by 
the number of cells examined. In contrast, microsateilite anal- 
yses for LOH cannot detect nonclonal changes but require that 
the chromosome alteration be present in approximately 40- 
50% of the sample (32, 33). 

The role of trisomy 7 in lung cancer development has not 
been elucidated. Increased expression of EGFR, which is lo- 
cated on chromosome 7 (34), is observed in 50-80% of 
NSCLCs (16, 35, 36). EGFR expression appears greater in SCC 
than AC (35, 36) and is amplified in some cell lines derived 
from SCC (37). These findings corroborate our hypothesis that 
acquisition of trisomy 7 in bronchial epithelium could be prog- 
nostic for development of SCC. Moreover, expression of this 
gene is also increased in nonmalignant bronchial epithelium 
from NSCLC patients (16, 35) and in normal or premalignant 
epithelium adjacent to head and neck tumors (38). Thus, altered 
expression of EGFR could enable cells that have acquired 
additional genetic changes to proliferate continually and escape 
from terminal differentiation (39). In addition, the c~met onco- 
gene is also located on chromosome 7 and is overexpressed in 
NSCLCs (40, 41). This oncogene encodes a transmembrane 
tyrosine kinase (42) thai Junctions as a receptor for the hepa- 
tocyte growth factor (43) and is involved in sustaining the 
growth of NSCLC cells in culture (44). 

Previous studies have detected mutations in p53 (12, 14, 
35), chromosome losses at 9p21 (45) and 3p (46) in preinvasive 
bronchial lesions, and simple chromosome rearrangements in 
normal bronchial epithelium from proximal airways (47) of 
lung cancer patients. The prevalence of these genetic changes in 
normal epithelium from persons at risk for lung cancer should 
be quantified by FISH to define the temporal sequences of 
somatic genetic changes that precede the development of clonal 
lesions in the lung. This information will be invaluable in 
providing biological markers that can qualitatively estimate the 
- extent of tield cancerization-in-persons-at-risk-f or lung-cancer — 
and can be used to assess the efficacy of chemointervention 
trials. Ultimately, the efficiency for detecting these biological 
markers in bronchial epithelium versus exfoliated epithelial 
cells within sputum must be established to support the use of a 
"genetic-based" screening approach for individuals at high risk 
For lung cancer. The results of the current investigation have 
identified one potential biomarker, trisomy 7, that may be 
useful in early detection and intervention for lung carcino- 
genesis. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WJSP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24«3. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to >40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-3/3) resulting in an increase in 
j3-catenin levels. Stabilized j3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
j3-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwri), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF1 00777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /xg 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /xM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2< Act > where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The W/Sf-specific signal was 
normalized to that of the gIyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. I A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response, to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M t 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of 27,000 (Af r 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Po!y(A) + RNA (2 fig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse W^SP-V-specific probe 
(amino acids 278-300) or a 190-bp WIS P-2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human 0-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3-4). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-j3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3.. (^4) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and 5). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C,£, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark -field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 
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the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISP-3 mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISP-3 is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors, Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISP-3 was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 




Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jig) 
digested with EcoKl (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 

DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., 0-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through )3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-0, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v fo serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-j31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISPS RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and )3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic j3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [1]. In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 
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Biotechnology; University of Washington, Box 357730, Seattle, WA, 
9819S, USA (Tel: +20^685-4235; Fax: +206-685-6392; E-mail: ruedi 
9u.washingtoQ.edu) 

Abbreviations: CID, collision-induced dissociation; MS/MS, tandem 
mass, spectrometry; SAGE, serial analysis of gene expression 

Keywords: Proteome / TWo-dimensional polyacrylamide gel electro- 
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2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([15-17]). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential. dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy [19, 20] and serial analysis of gene expression 
(SAGE) [21, 22], it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and: sensitivity which is not matched by current 
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protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome 
projects should be undertaken and what specific types of 
information could be expected from proteome projects 
which cannot be obtained from genomic and transcript 
profiling projects. We see three main reasons for pro- 
teome analysis to become an essential component in the 
comprehensive analysis of biological systems, (i) Protein 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system. 

2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et a/., submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables [22]. Protein 
expression levels were quantified by metabolic radiola- 
beling of the yeast proteins, liquid scintillation counting 
of the protein spots, separated by high resolution 2-DE 
and mass spectrometric identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(Fig. 1). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cript levels varying by as much as 40-fold (S. P. Gygi 
et a/., submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
from the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylation, phosphor- 
ylation, prenylation, acylation, ubiquitination or one or 
more of many other modifications [23] and many pro- 
teins are only functional if specifically associated or com- 
plexed with other molecules, including DNA, RNA, pro- 
teins and organic and inorganic cofactors. Frequently, 
modifications are dynamic and reversible and may alter 
the precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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Figure I. Correlation between mRNA and protein levels in yeast celts. 
For a selected population of 80 genes, protein levels were measured 
by 35 -S-radiolabeling and mRNA levels were calculated from publi- 
shed SAGE tables. Inset: expanded view of the low abundance region. 
For more experimental details, also see Figs. 5 and 6, (S. P. Gygi et al. t 
submitted). 

are important indicators for the state of the system. The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
from the gene sequence alone. 



23 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress, 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translation^ and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 
tein products in cells is essential as there are numerous 
levels of control of protein synthesis, degradation^ 
processing and modification, which are only apparent by 
direct protein analysis. 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system, 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species, available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS poiyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
"techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proteomics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

32 2-D electrophoresis — mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (IEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (IEF) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25] and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from poiyacrylamide gels have traditionally 
been identified using TV-terminal sequencing [26, 27], 
internal peptide sequencing (28, 29], immunoblotting or 
comigration with known proteins [30]. The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31-33]. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins, 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymaticaliy or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28, 34] or digestion after electrotransfer out 
of the gel onto a suitable membrane ([29, 35-37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is an important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified, by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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Figure 2. Schematic diagram of a procedure for tdentificaUon of gel- 
separated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 

raphy [39, 40] or capillary electrophoresis [41], the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33]. The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 

33 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 
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required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

33.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42]. This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

332 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 \im ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
et a/., submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

333 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupoie and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level [43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10-30 um in depth and 50-70 \im in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig. 3). The channels are 
connected to an external high voltage power supply [45], 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/uL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 
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Figure 3. Schematic illustration of a 
microfabricated analytical system for CE, 
.consisting of a micromachined device, 
coated capillary electroosmotic pump, 
and microeleclrospray interface. The 
dimensions of the channels and reservoir 
arc as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from [45], with permission. 



hydrase. Some of the unassigned signals may be chemical - 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search ' sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 5 
rate of approximately one protein per 15 min. ;; 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Pig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins.iCodon bias is a 
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Figure 4. MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from (45), with permission. 
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T^^^ IS™ ■ WW SDS-PAGE gel. Proteins were visualized by silver staining. Further details of experimental 
procedures are included in S. P. Gygi et at. (submitted). 



calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified Q> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast, 
codon bias values calculated for the entire yeast genome 
(Fig. 6a) show that the majority of proteins present in 
the proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status, of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells [16]. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48]. 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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Figure 6. Calculated codon bias vaJues for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. 1 and 5. Further details of experimental procedures are 
included in S. P. Gygi ef al (submitted).. 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a. mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological rer 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive/ The most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7. The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent [49]. 
It is obvious that the protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no comlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 

4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita-; 




# 



Electrophoresis 199B. 19, 1862-1871 



Proteome analysis: Biological assay or data archive 



1869 



rive comparative analysis of hundreds to a few thousand 
nroteins Comparative analysis of the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ- 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared. It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 
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Figure 7. High resolution 2-DE map of proteins isolated from rat serum with or without prior exposure to an inflam- 
mation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion is a 7.5-17.5%T gradient SDS-PAOE gel. Proteins were visualized by staining with amido black. Further details 
of experimental procedures are included in [14, 49). 
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Proteome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
pi) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DB protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing different states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
•we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential toot in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes; A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualised by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 
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We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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SUMMARY: These Guidelines will be used by USPTO personnel in their 
review of patent applications for compliance with the "written 
description*' requirement of 35 U.S.C. 112, para. 1. These Guidelines 
supersede the ""Revised Interim Guidelines for Examination of Patent 
Applications Under the 35 U.S.C. 112, para. 1 "Written Description' 
Requirement'' that were published in the Federal Register at 64 FR 
71427, Dec. 21, 1999, and in the Official Gazette at 1231 O.G. 123, 
Feb. 29, 2000. These Guidelines reflect the current understanding of 
the USPTO regarding the written description requirement of 35 U.S.C. 
112, para. 1, and are applicable to all technologies. 

DATES: The Guidelines are effective as of January 5, 2001. 

FOR FURTHER INFORMATION CONTACT: Stephen Walsh by telephone at (703) 
305-9035, by facsimile at (703) 305-9373, by mail to his attention 
addressed to United States Patent and Trademark Office, Box 8, 
Washington, DC 20231, or by electronic mail at 

" " Stephen . walsh@uspto . gov ' '; or Linda Therkorn by telephone at (703) 
305-8800, by facsimile at (703) 305-8825, by mail addressed to Box 
Comments, Commissioner for Patents, Washington, DC 20231, or by 
electronic mail at " " linda . therkorn@uspto . gov . ' ' 

SUPPLEMENTARY INFORMATION: As of the publication date of this notice, 
these Guidelines will be used by USPTO personnel in their review of 
patent applications for compliance with the ""written description' ' 
requirement of 35 U.S.C. 112, para. 1. Because these Guidelines only 
govern internal practices, they are exempt from notice and comment 
rulemaking under 5 U.S.C. 553 (b) (A) . 

Discussion of Public Comments 

Comments were received from 48 individuals and 18 organizations in 
response to the request for comments on the ""Revised Interim 
Guidelines for Examination of Patent Applications Under the 35 U.S.C. 
112, para. 1 "Written Description' Requirement'* published in the 
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Federal Register at 64 FR 71427, Dec. 21, 1999, and in the Official 
Gazette at 1231 O.G. 123, Feb. 29, 2000. The written comments have been 
carefully considered. 

Overview of Comments 

The majority of comments favored issuance of final written 
description guidelines with minor revisions. Comments pertaining to the 
written description guidelines are addressed in detail below. A few 
comments addressed particular concerns with respect to the associated 
examiner training materials that are available for public inspection at 
the USPTO web site ( www. uspto . gov ) . Such comments will be taken under 
advisement in the revision of the training materials; consequently, 
these comments are not specifically addressed below as they do not 
impact the content of the Guidelines. Several comments raised issues 
pertaining to the patentability of ESTs, genes, or genomic inventions 
with respect to subject matter eligibility (35 U.S.C. 101), novelty (35 
U.S.C. 102), or obviousness (35 U.S.C. 103). As these comments do not 
pertain to the written description requirement under 35 U.S.C. 112, 
they have not been addressed. However, the aforementioned comments are 
fully addressed in the "Discussion of Public Comments'' in the 
""Utility Examination Guidelines'* Final Notice, which will be 
published at or about the same time as the present Guidelines. 

Responses to Specific Comments 

(1) Comment: One comment stated that the Guidelines instruct the 
patent examiner to determine the correspondence between what applicant 
has described as the essential identifying characteristic features of 
the invention and what applicant has claimed, and that such analysis 
will lead to error. According to the comment, the examiner may decide 
what applicant should have claimed and reject the claim for failure to 
claim what the examiner considers to be the invention. Another comment 
suggested that the Guidelines should clarify what is meant by 
""essential features of the invention.'' Another comment suggested that 
what applicant has identified as the ""essential distinguishing 
characteristics'* of the invention should be understood in terms of 
Fiers v. Revel, 984 F.2d 1164, 1169, 25 USPQ2d 1601, 1605 (Fed. Cir. 
1993) (""Conception of a substance claimed per se without reference to 
a process requires conception of its structure, name, 

[ [Page 1100] ] 

formula, or definitive chemical or physical properties.' 1 ). 

Response: The suggestions have been adopted in part. The purpose of 
the written description analysis is to confirm that applicant had 
possession of what is claimed. The Guidelines have been modified to 
instruct the examiners to compare the scope of the invention claimed 
with the scope of what applicant has defined in the description of the 
invention. That is, the Guidelines instruct the examiner to look for 
consistency between a claim and what provides adequate factual support 
for the claim as judged by one of ordinary skill in the art from 
reading the corresponding written description. 

(2) Comment: Two comments urge that Regents of the University of 
California v. Eli Lilly & Co., 119 F.3d 1559, 43 USPQ2d 1398 (Fed. Cir. 
1997), is bad law and should not be followed by the USPTO because it 
conflicts with binding precedent, such as Vas-Cath v. Mahurkar, 935 
F.2d 1555, 19 USPQ2d 1111 (Fed. Cir. 1991) . Response: The final 
Guidelines are based on the Office's current understanding of the law 
and are believed to be fully consistent with binding precedent of the 
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U.S. Supreme Court and the U.S. Court of Appeals for the Federal 
Circuit. Eli Lilly is a precedential decision by the Court that has 
exclusive jurisdiction over appeals involving patent law. Accordingly, 
the USPTO must follow Eli Lilly. Furthermore, the USPTO does not view 
Eli Lilly as conflicting with Vas-Cath. Vas-Cath explains that the 
purpose of the written description requirement is to ensure that the 
applicant has conveyed to those of skill in the art that he or she was 
in possession of the claimed invention at the time of filing. Vas-Cath, 
935 F.2d at 1563-64, 19 USPQ2d at 1117. Eli Lilly explains that a 
chemical compound's name does not necessarily convey a written 
description of the named chemical compound, particularly when a genus 
of compounds is claimed. Eli Lilly, 119 F.3d at 1568, 43 USPQ2d at s 
1405. The name, if it does no more than distinguish the claimed genus 
from all others by function, does not satisfy the written description 
requirement because "it does not define any structural features 
commonly possessed by members of the genus that distinguish them from 
others. One skilled in the art therefore cannot, as one can do with a 
fully described genus, visualize or recognize the identity of the 
members of the genus.'' Eli Lilly, 119 F.3d at 1568, 43 USPQ2d at 1406. 
Thus, Eli Lilly identified a set of circumstances in which the words of 
the claim did not, without more, adequately convey to others that 
applicants had possession of what they claimed. 

(3) Comment: Several comments urged that the Guidelines do not 
recognize the inconsistency between the original claim doctrine and the 
written description requirement as set out in Fiers and Eli Lilly. On 
the other hand, another comment asserts that there is no strong 
presumption that an originally filed claim constitutes an adequate 
written description of the claimed subject matter. Several comments 
indicate that in haec verba support should be sufficient to comply with 
the written description requirement. Two comments urge that the concept 
of constructive reduction to practice upon filing of an application has 
been ignored. Response: As noted above, the USPTO does not find Fiers 
and Eli Lilly to be in conflict with binding precedent. An original 
claim may provide written description for itself, but it still must be 
an adequate written description which establishes that the inventor was 
in possession of the invention. The ""original claim doctrine' 1 is 
founded on cases which stand for the proposition that originally filed 
claims are part of the written description of an application as filed, 
and thus subject matter which is present only in originally filed 
claims need not find independent support in the specification. See, 
e.g., In re Roller, 613 F.2d 819, 824, 204 USPQ 702, 706 (CCPA 1980) 
(later added claims of similar scope and wording were adequately 
described by original claims); In re Gardner, 480 F.2d 879, 880, 178 
USPQ 149, 149 (CCPA 1973) (""Under these circumstances, we consider the 
original claim in itself adequate "written description' of the claimed 
invention. It was equally a "written description' * * * whether located 
among the original claims or in the descriptive part of the 
specif ication. '») . However, as noted in the preceding comment, Eli 
Lilly identified a set of circumstances in which the words of the claim 
did not, without more, adequately convey to others that applicants had 
possession of what they claimed. When the name of a novel chemical 
compound does not convey sufficient structural information about the 
compound to identify the compound, merely reciting the name is not 
enough to show that the inventor had possession of the compound at the 
time the name was written. The Guidelines indicate that there is a 
""strong presumption 1 ' that an adequate written description of the 
claimed invention is present when the application is filed, consistent 
with In re Wertheim, 541 F.2d 257, 263, 191 USPQ 90, 97 (CCPA 1976) 
(""we are of the opinion that the PTO has the initial burden of 
presenting evidence or reasons why persons skilled in the art would not 
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recognize in the disclosure a description of the invention defined by 
the claims.*'). In most cases, the statement that ""an originally filed 
claim is its own written description, ' 1 is borne out because the claim 
language conveys to others of skill in the art that the applicant was 
"in possession' 1 of what is claimed. The Guidelines emphasize that the 
burden of proof is on the examiner to establish that a description as 
filed is not adequate and require the examiner to introduce sufficient 
evidence or technical reasoning to shift the burden of going forward 
with contrary evidence to the applicant. 

(4) Comment: One comment stated that the Guidelines change the 
substance of the written description requirement to require some level 
of enablement. The comment stated that the Eli Lilly case should not be 
followed because its change in the quality of the description required 
is in conflict with precedent. Another comment suggested that to comply 
with the written description requirement, the description must both (i) 
demonstrate possession of the claimed invention by the applicant; and 
(ii) put the public in possession of the claimed invention. Response: 
As noted in the comment above, the USPTO is bound by the Federal 
Circuit's decision in Eli Lilly. The Guidelines have been revised to 
clarify that an applicant must provide a description of the claimed 
invention which shows that applicant was in possession of the claimed 
invention. The suggestion to emphasize that the written description 
requirement must put the public in possession of the invention has not 
been adopted because it removes much of the distinction between the 
written description requirement and the enablement requirement. 
Although the two concepts are entwined, they are distinct and each is 
evaluated under separate legal criteria. The written description 
requirement, a question of fact, ensures that the inventor conveys to 
others that he or she had possession of the claimed invention; whereas, 
the enablement requirement, a question of law, ensures that the 
inventor conveys to others how to make and use the claimed invention. 

(5) Comment: One comment suggested that the Guidelines should 
provide examples of situations in which the written description 
requirement was met but the enablement requirement was not, and vice 
versa. Another comment stated that examiners often use enablement 
language in making 
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written description rejections. Response: The enablement and written 
description requirements are not coextensive and, therefore, situations 
will arise in which one requirement is met but the other is not. 
Federal Circuit case law demonstrates many circumstances where 
enablement or written description issues, but not both, were before the 
Court. These Guidelines are intended to clarify for the examining corps 
the criteria needed to satisfy the written description requirement. For 
examples applying these Guidelines to hypothetical fact situations, see 
the "Synopsis of Application of Written Description Guidelines'' 
(examiner training materials available on-line at http : / / www . uspto . gov/ 
web/menu/written . pdf ) . These examples, as well as the examination form 
paragraphs and instructions on their proper use, provide the 
appropriate language examiners should use in making written description 
rejections. 

(6) Comment: One comment disagreed with the statement in an endnote 
that ""the fact that a great deal more than just a process is necessary 
to render a product invention obvious means that a great deal more than 
just a process is necessary to provide written description for a 
product invention. ' ' The comment indicated that the statement is overly 
broad and inconsistent with the ""strong presumption that an adequate 
written description of the claimed invention is present when the 
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application is filed. 11 As an extreme case, for example, for product - 
by-process claims, nothing else would be needed to provide the written 
description of the product. Response: The endnote has been clarified 
and is now more narrowly drawn. However, there is no per se rule that 
disclosure of a process is sufficient to adequately describe the 
products produced by the process. In fact, Fiers v. Revel and Eli Lilly 
involved special circumstances where the disclosure of a process of 
making and the function of the product alone did not provide an 
adequate written description for product claims. Even when a product is 
claimed in a product-by-process format, the adequacy of the written 
description of the process to support product claims must be evaluated- 
on a case-by-case basis. 

(7) Comment: Several comments urge that actual reduction to 
practice, as a method of satisfying the written description requirement 
by demonstrating possession, has been over -emphasized. Response: The 
Guidelines have been clarified to state that describing an actual 
reduction to practice is one of a number of ways to show possession of 
the invention. Description of an actual reduction to practice offers an 
important ""safe haven' ' that applies to all applications and is just 
one of several ways by which an applicant may demonstrate possession of 
the claimed invention. Actual reduction to practice may be crucial in 
the relatively rare instances where the level of knowledge and level of 
skill are such that those of skill in the art cannot describe a 
composition structurally, or specify a process of making a composition 
by naming components and combining steps, in such a way as to 
distinguish the composition with particularity from all others. Thus, 
the emphasis on actual reduction to practice is appropriate in those 
cases where the inventor cannot provide an adequate description of what 
the composition is, and a definition by function is insufficient to 
define a composition "because it is only an indication of what the 
[composition] does, rather than what it is.'' Eli Lilly, 119 F.3d at 
1568, 43 USPQ at 1406. See also Amgen Inc. v. Chugai Pharmaceutical 
Co., 927 F.2d 1200, 1206, 18 USPQ2d 1016, 1021 (Fed. Cir. 1991). 

(8) Comment: One comment asserts that the citation to Pfaff v. 
Wells Electronics, Inc., 525 U.S. 55, 48 USPQ2d 1641 (1998) is 
inappropriate and should be deleted because Pfaff is concerned with 
Sec. 102(b) on-sale bar, not written description. Another comment 
suggested that the Guidelines should provide an explanation of how the 
""ready for patenting' 1 concept of Pfaff should be used in determining 
compliance with the written description requirement. Response: The 
Guidelines state the general principle that actual reduction to 
practice is not required to show possession of, or to adequately 
describe, a claimed invention (although, as noted in the previous 
comment, an actual reduction to practice is crucial in relatively rare 
instances) . An alternative is to show that the invention described was 
""ready for patenting* 1 as set out in Pfaff. For example, a description 
of activities that demonstrates the invention was ""ready for 
patenting' 1 satisfies the written description requirement. As Wertheim 
indicates, ""how the specification accomplishes this is not material. 1 ' 
541 F.2d at 262, 191 USPQ at 96. 

(9) Comment: One comment stated that the written description of a 
claimed DNA should be required to include the complete sequence of the 
DNA and claims should be limited to the DNA sequence disclosed. 
Response: Describing the complete chemical structure, i.e., the DNA 
sequence, of a claimed DNA is one method of satisfying the written 
description requirement, but it is not the only method. See Eli Lilly, 
119 F.3d at 1566, 43 USPQ2d at 1404 (""An adequate written description 
of a DNA * * * requires a precise definition, such as by structure, 
formula, chemical name, or physical properties.' 1 (emphasis added, 
internal quote omitted)) . Therefore, there is no basis for a per se 
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rule requiring disclosure of complete DNA sequences or limiting DNA 
claims to only the sequence disclosed. 

(10) Comment: One comment stated that it is difficult to envision 
how one could provide a description of sufficient identifying 
characteristics of the invention without physical possession of a 
species of the invention, and thus this manner of showing possession 
should be considered as a way to show actual reduction to practice. 
Response: This suggestion has not been adopted. The three ways of 
demonstrating possession as set forth in the Guidelines are merely 
exemplary and are not mutually exclusive. While there are some cases 
where a description of sufficient relevant identifying characteristics 
will evidence an actual reduction to practice, there are other cases 
where it. will not. See, e.g., Ralston Purina Co. v. Far-Mar-Co, Inc., 
772 F.2d 1570, 1576, 227 USPQ 177, 180 (Fed. Cir. 1985) (disclosure 
taken with the knowledge of those skilled in the art may be sufficient 
support for claims) . 

(11) Comment: One comment stated that the Guidelines should.be 
revised to indicate that the test of disclosure of sufficiently 
detailed drawings should be expanded to include structural claiming of 
chemical entities. Response: The suggestion has been adopted. 

(12) Comment: One comment stated that the Guidelines should reflect 
that an inventor is in possession of the invention when the inventor 
demonstrably has at least a complete conception thereof, and that 
factors and attributes which provide proof of written description 
should include evidence typically provided to prove a complete 
conception. Response: The suggestion has not been adopted because the 
conception analysis typically involves documentary evidence in addition 
to the description of the invention in the application as filed. 
However, it is acknowledged that if evidence typically provided to 
prove a complete conception is present in the specification as filed, 
it would be sufficient to show possession. The Federal Circuit has 
stated "[t]he conception analysis necessarily turns on the inventor's 
ability to describe his invention with particularity. Until he can do 
so, he cannot prove possession 
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of the complete mental picture of the invention. 1 ' Burroughs Wellcome 
Co. v. Barr Labs., Inc., 40 F.3d 1223, 1228, 32 USPQ2d 1915, 1919 (Fed. 
Cir. 1994) . As further noted by the Federal Circuit, in order to prove 
conception, ""a party must show possession of every feature recited in 
the count, and that every limitation of the count must have been known 
to the inventor at the time of the alleged conception. 1 ' Coleman v. 
Dines, 754 F.2d 353, 359, 224 USPQ 857, 862 (Fed. Cir. 1985). 

(13) Comment: One comment indicated that a " "possession ' ■ test does 
not appear in Title 35 of the U.S. Code and is not clearly stated by 
the Federal Circuit. Therefore, it is recommended that patent examiners 
be directed to use existing judicial precedent to make rejections of 
claims unsupported by a statutory written description requirement. 
Response: While the Federal Circuit has not specifically laid out a 
" "possession ' ' test, the Court has clearly indicated that possession is 
a cornerstone of the written description inquiry. See, e.g., Vas-Cath, 
Inc. v. Mahurkar, 935 F.2d 1555, 1563, 19 USPQ2d 1111, 1116 (Fed. Cir. 
1991); see also Purdue Pharma L.P. v. Faulding Inc., 230 F.3d 1320, 
1323, 56 USPQ2d 1481, 1483 (Fed. Cir. 2000) ( vv [o]ne skilled in the 
art, reading the disclosure, must immediately discern the limitation at 
issue in the claims 1 ') (internal quote omitted). The possession test as 
set forth in the Guidelines is extrapolated from case law in a wide 
variety of technologies and is not intended to be limiting. Any 
rejections made by examiners will be made under 35 U.S.C. 112, para.l, 



http://fhvebgate.access.gpo.goW 11/14/2005 



Page 7 of 28 




with supporting rationale. Final rejections are appealable if applicant 
disagrees and follows the required procedures to appeal. 

(14) Comment: Two comments indicated that if the amino acid 
sequence for a polypeptide whose utility has been identified is 
described, then the question of possession of a class of nucleotides 
encoding that polypeptide can be addressed as a relatively routine 
matter using the understanding of the genetic code, and that the 
endnote addressing this issue should be revised. Response: The 
suggestion of these comments has been incorporated in the Guidelines 
and will be reflected in the training materials. However, based upon In 
re Bell, 991 F.2d 781, 785, 26 USPQ2d 1529, 1532 (Fed. Cir. 1993) and 
In re Baird, 16 F.3d 380, 382, 29 USPQ2d 1550, 1552 (Fed. Cir. 1994), 
this does not mean that applicant was in possession of any particular 
species of the broad genus. 

(15) Comment: One comment disagreed with an endnote which stated 
that a laundry list disclosure of moieties does not constitute a 
written description of every species in a genus. Specifically, the 
comment indicates that if the existence of a functional genus is 
adequately described in the specification, a laundry list of the 
species within that genus must satisfy the written description 
requirement. Response: The suggestion to revise the endnote will not be 
adopted. A lack of adequate written description problem arises if the 
knowledge and level of skill in the art would not permit one skilled in 
the art to immediately envisage the product claimed from the 
disclosure. This was aptly demonstrated in In re Bell and In re Baird 
where possession of a large genus did not put a person of ordinary 
skill in the art in possession of any particular species. See also 
Purdue Pharma, 230 F.3d at 1328, 56 USPQ2d at 1487 (because the 
original specification did not disclose the later claimed concentration 
ratio was a part of the invention, the inventors cannot argue that they 
are merely narrowing a broad invention) . 

(16) Comment: One comment suggested that in the majority of cases, 
a single species will support a generic claim, and that the Guidelines 
should emphasize this point. Response: The suggestion has been adopted 
to a limited degree. The Guidelines now indicate that a single species 
may, in some instances, provide an adequate written description of a 
generic claim when the description of the species would evidence to one 
of ordinary skill in the art that the invention includes the genus. 
Note, however, Tronzo v. Biomet, Inc., 156 F.3d 1154, 47 USPQ2d 1829 
(Fed. Cir. 1998), where the species in the parent application was held 
not to provide written description support for the genus in the child 
application. 

(17) Comment: One comment asserted that the Guidelines should focus 
on the compliance of the claims, not the specification, with the 
written description requirement. Response: This suggestion will not be 
adopted. "The specification shall contain a written description of the 
invention. ' ' 35 U.S.C. 112. The claims are part of the specification. 
Id., para. 2. If an adequate description is provided, it will suffice 
"whether located among the original claims or in the descriptive part 
of the specif ication. » ' In re Gardner, 480 F.2d 879, 880, 178 USPQ 149 
(CCPA 1973) . The entire disclosure, including the specification, 
drawings, and claims, must be considered. 

(18) Comment: One comment asserted that the Guidelines confuse 
"new matter, ' * 35 U.S.C. 132, with the written description 
requirement, and that the same standard for written description should 
be applied to both original claims and new or amended claims. Response: 
The Guidelines indicate that for both original and amended claims, the 
inquiry is whether one skilled in the art can reasonably conclude that 
the inventor had possession of the claimed invention at the time the 
application was filed. 
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(19) Comment: One comment suggested that the second paragraph of 
the section pertaining to determining what the claim as a whole covers 
should be deleted because it relates more to compliance with Sec. 112, 
second paragraph, than with the written description requirement. 
Response: This suggestion will not be adopted. The claims must be 
construed and all issues as to the scope and meaning of the claim must 
be explored during the inquiry into whether the written description 
requirement has been met. The concept of treating the claim as a whole 
is applicable to all criteria for patentability. 

(20) Comment: One comment suggested a different order for the 
general analysis for determining compliance with the written 
description requirement, starting with reading the claim, then the 
specification, and then determining whether the disclosure demonstrates 
possession by the applicant. Response: This suggestion will not be 
adopted. The claims must be construed as broadly as reasonable in light 
of the specification and the knowledge in the art. See In re Morris, 
127 F.3d 1048, 1054, 44 USPQ2d 1023, 1027 (Fed. Cir. 1997). Then the 
disclosure must be evaluated to determine whether it adequately 
describes the claimed invention, i.e., whether it conveys to a person 
having ordinary skill in the art that the applicant had possession of 
what he or she now claims. 

(21) Comment: Several comments suggested that the Guidelines are 
unclear with regard to how the examiner should treat the transitional 
phrase ""consisting essentially of.'' The comments also suggested that 
the endnote that explains "consisting essentially of' 1 does not make 
clear how the use of this intermediate transitional language affects 
the scope of the claim. Several comments stated that the USPT0 does not 
have legal authority to treat claims reciting this language as open 
(equivalent to ""comprising''). Another comment suggested that the 
phrase ""clear indication in the specification 1 ' be replaced with 
""explicit or implicit indication. ' ' Response: The transitional phrase 
""consisting essentially of*' ""excludes 
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ingredients that would "materially affect the basic and novel 
characteristics' of the claimed composition.*' Atlas Powder Co. v. E.I. 
DuPont de Nemours & Co., 750 F.2d 1569, 1574, 224 USPQ 409, 412 (Fed. 
Cir. 1984) . The basic and novel characteristics of the claimed 
invention are limited by the balance of the claim. In re Janakirama- 
Rao, 317 F.2d 951, 954, 137 USPQ 893, 896 (CCPA 1963). However, during 
prosecution claims must be read broadly, consistent with the 
specification. In re Morris, 127 F.3d 1048, 1054, 44 USPQ2d 1023, 1027 
(Fed. Cir. 1997) . Thus, for purposes of searching for and applying 
prior art in a rejection under 35 U.S.C. 102 or 103, if the 
specification or the claims do not define the ""basic and novel' * 
properties of the claimed subject matter (or if such properties are in 
dispute) , the broadest reasonable interpretation consistent with the 
specification is that the basic and novel characteristics are merely 
the presence of the recited limitations. See, e.g., Janakirama-Rao, 317 
F.2d at 954, 137 USPQ at 895-96. This does not indicate that the 
intermediate transitional language is never given weight. Applicants 
may amend the claims to avoid the rejections or seek to establish that 
the specification provides definitions of terms in the claims that 
define the basic and novel characteristics of the claimed invention 
which distinguish the claimed invention from the prior art. When an 
applicant contends that additional steps or materials in the prior art 
are excluded by the recitation of "consisting essentially of, ' 
applicant has the burden of showing that the introduction of additional 
steps or components would materially change the characteristics of 
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applicant's invention. In re De Lajarte, 337 F.2d 870, 143 USPQ 256 
(CCPA 1964) . The language used in the Guidelines is consistent with PPG 
Industries Inc. v. Guardian Industries Corp., 156 F.3d 1351, 1355, 48 
USPQ2d 1351, 1355 (Fed. Cir. 1998) (""PPG could have defined the scope 
of the phrase "consisting essentially of for purposes of its patent by 
making clear in its specification what it regarded as constituting a 
material change in the basic and novel characteristics .'') . 

(22) Comment:. One comment stated that the written description 
should ""disclose the invention, ' ' including why the invention works 
and how it was developed. Response: This suggestion has not been 
adopted. An inventor does not need to know how or why the invention 
works in order to obtain a patent. Newman v. Quigg, 877 F.2d 1575, 
1581, 11 USPQ2d 1340, 1345 (Fed. Cir. 1989). To satisfy the enablement 
requirement of 35 U.S.C. 112, para.l, an application must disclose the 
claimed invention in sufficient detail to enable a person of ordinary 
skill in the art to make and use the claimed invention. To satisfy the 
written description requirement of 35 U.S.C. 112, para. 1, the 
description must show that the applicant was in possession of the 
claimed invention at the time of filing. There is no statutory basis to 
require disclosure of why an invention works or how it was developed. 
""Patentability shall not be negatived by the manner in which the 
invention was made. 1 ' 35 U.S.C. 103(a). 

(23) Comment: One comment recommended that the phrases ""emerging 
and unpredictable technologies'' and ""unpredictable art' 1 be replaced 
with the phrase- -inventions characterized by factors which are not 
reasonably predictable in terms of the ordinary skill in the art--. 
Response: The suggestion is adopted in part and the recommended phrase 
has been added as an alternative. 

(24) Comment: One comment recommended that the phrase 
""conventional in the art 1 ' be replaced with--part of the knowledge of 
one of ordinary skill in the art--. Response: The suggestion is adopted 
in part and the recommended phrase has been added as an alternative. 
The standard of ""conventional in the art 1 * is supported by case law 
holding that a patent specification ""need not teach, and preferably 
omits, what is well known in the art. 1 ' See Spectra-Physics, Inc. v. 
Coherent, Inc., 827 F.2d 1524, 1534, 3 USPQ2d 1737, 1743 (Fed. Cir. 
1987); Hybritech Inc. v. Monoclonal Antibodies, Inc., 802 F.2d 1367, 
1384, 231 USPQ 81, 94 (Fed. Cir. 1986). See also Atmel Corp. v. 
Information Storage Devices, Inc., 198 F.3d 1374, 1382, 53 USPQ2d 1225, 
1231 (Fed. Cir. 1999) . 

(25) Comment: One comment recommended that the Guidelines be 
amended to state that the appropriate skill level for determining 
possession of the claimed invention is that of a person of ordinary 
skill in the art. Response: The comment has not been adopted. The 
statutory language itself indicates that compliance with the 
requirements of 35 U.S.C. 112, para.l, is judged from the standard of 
""any person skilled in the art. 1 ' It is noted, however, that the 
phrases ""one of skill in the art 11 and ""one of ordinary skill in the 
art' 1 appear to be synonymous. See, e.g., Union Oil Co. v. Atlantic 
Richfield Co., 208 F.3d 989, 997, 54 USPQ2d 1227, 1232 (Fed. Cir. 2000) 
(""The written description requirement does not require the applicant 
"to describe exactly the subject matter claimed, [instead] the 
description must clearly allow persons of ordinary skill in the art to 
recognize that [he or she] invented what is claimed.' Thus, Sec. 112, 
para. 1, ensures that, as of the filing date, the inventor conveyed 
with reasonable clarity to those of skill in the art that he was in 
possession of the subject matter of the claims. 1 ' (citations omitted, 
emphasis added) ) . 

(26) Comment: One comment stated that an endnote misstates the 
relevant law in stating that, to show inherent written descriptive 
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support for a claim limitation, the inherent disclosure must be such as 
would be recognized by a person of ordinary skill in the art. The 
comment recommended that the endnote be amended to delete the reference 
to recognition by persons of ordinary skill and to cite Pingree v. 
Hull, 518 F.2d 624, 186 USPQ 248 (CCPA 1975), rather than In re 
Robertson, 169 F.3d 743, 49 USPQ2d 1949 (Fed. Cir. 1999). Response: The 
comment has not been adopted. Federal Circuit precedent makes clear 
that an inherent disclosure must be recognized by those of ordinary 
skill in the art. See, e.g., Hyatt v. Boone, 146 F.3d 1348, 1354-55, 47 
USPQ2d 1128, 1132 (Fed. Cir. 1998) (""[T]he purpose of the description 
requirement is "to ensure that the inventor had possession, as of the 
filing date of the application relied on, of the specific subject 
matter later claimed by him. ' * * * Thus, the written description must 
include all of the limitations of the interference count, or the 
applicant must show that any absent text is necessarily comprehended in 
the description provided and would have been so understood at the time 
the patent application was filed. ' 1 (emphasis added)). See also Reiffin 
v. Microsoft Corp., 214 F.3d 1342, 1346, 54 USPQ2d 1915, 1917 (Fed. 
Cir. 2000) (The ""application considered as a whole must convey to one 
of ordinary skill in the art, either explicitly or inherently, that 
[the inventor] invented the subject matter claimed * * *. See * * * 
Continental Can Co. USA v. Monsanto Co., 948 F.2d 1264, 1268, 20 USPQ2d 
1746, 1749 (Fed. Cir. 1991) (descriptive matter may be inherently 
present in a specification if one skilled in the art would necessarily 
recognize such a disclosure) * 1 ) . 

(27) Comment: Several comments pointed out an inconsistency in the 
Federal Register Notice re: the Revised Interim Written Description 
Guidelines. The inconsistency concerned the treatment of claims 
directed to an isolated DNA comprising SEQ ID NO:l wherein SEQ ID NO : 1 
is an expressed sequence tag. The comments contrasted paragraphs 34 and 
3 5 of the Response to 
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Public Comments with the statement in the text of the Guidelines that a 
genus must be supported by a representative number of species (as 
analyzed in Example 7 of the training materials). Response: The USPTO 
acknowledges that there was an inconsistency. The Office notes that a 
claim reciting a nucleic acid comprising SEQ ID N0:1 may be subject to 
a rejection for lack of an adequate written description where 
particular identifiable species within the scope of the claim lack an 
adequate written description. The training materials as amended 
exemplify an appropriate analysis. 

(2 8) Comment: One comment stated that the USPTO should respond to 
the issue of whether the U.S. is meeting its TRIPs obligations. This 
comment noted that the USPTO did not address an earlier comment 
regarding the ""Interim Guidelines for the Examination of Patent 
Applications under the 35 U.S.C. 112, para. 1, "Written Description' 
Requirement, ' ' 63 FR 32,639, June 15, 1998, which questioned whether 
the written description requirement is truly different from the 
enablement requirement, and indicated that such a requirement may be 
contrary to the TRIPs provisions of the World Trade Organization 
(Article 27.1). Article 27.1 requires WTO Members to, inter alia, make 
patents available, with limited exceptions, for products and processes 
in all fields of technology so long as those products and processes are 
new, involve an inventive step, and are capable of industrial 
application. The comment further suggested a response. Response: TRIPs 
Article 27 does not address what must be included in a patent 
application to allow WTO Member officials to determine whether 
particular inventions meet the standards for patentability established 
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in that Article. TRIPs Article 29, which is more relevant to this 
comment, states that Members "shall require 1 ' patent applicants to 
disclose their invention "in a manner sufficiently clear and complete 
for the invention to be carried out by a person skilled in the art. 1 ' 
If the written description is not clear and complete, the applicant may 
not have been in possession of the invention. This may support both 
written description and enablement standards. In addition, Article 29 
expressly authorizes Members to require patent applicants to disclose 
the best method the inventor knows at the time of filing an application 
for carrying out the invention. 

(29) Comment: Two comments commended the USPTO for eliminating the 
Biotechnology Specific Examples in the Revised Interim Written 
Description Guidelines and providing separate training materials. One 
comment indicated a need to reconfirm the examples set forth in the 
Interim Written Description Guidelines published in 1998. Response: The 
current training materials reflect the manner in which the USPTO 
interprets the Written Description Guidelines. 

(30) Comment: Several comments addressed specific concerns about 
the examiner training materials. Response: The comments received with 
respect to the training materials will be taken under advisement as the 
Office revises the training materials in view of the revisions to the 
Guidelines. The specific comments will not be addressed herein as they 
do not impact the language of the Guidelines. 

Guidelines for the Examination of Patent Applications Under the 35 
U.S.C. 112, para. 1, ""Written Description' ' Requirement 

These "Written Description Guidelines'* are intended to assist 
Office personnel in the examination of patent applications for 
compliance with the written description requirement of 35 U.S.C. 112, 
para. 1. This revision is based on the Office's current understanding 
of the law and public comments received in response to the USPTO 's 
previous request for public comments on its Revised Interim Written 
Description Guidelines and is believed to be fully consistent with 
binding precedent of the U.S. Supreme Court, as well as the U.S. Court 
of Appeals for the Federal Circuit and its predecessor courts. 

This revision does not constitute substantive rulemaking and hence 
does not have the force and effect of law. It is designed to assist 
Office personnel in analyzing claimed subject matter for compliance 
with substantive law. Rejections will be based upon the substantive 
law,, and it is these rejections which are appealable. Consequently, any 
perceived failure by Office personnel to follow these Guidelines is 
neither appeaiable nor petitionable . 

These Guidelines are intended to form part of the normal 
examination process. Thus, where Office personnel establish a prima 
facie case of lack of written description for a claim, a thorough 
review of the prior art and examination on the merits for compliance 
with the other statutory requirements, including those of 35 U.S.C. 
101, 102, 103, and 112, is to be conducted prior to completing an 
Office action which includes a rejection for lack of written 
description. Office personnel are to rely on this revision of the 
Guidelines in the event of any inconsistent treatment of issues 
involving the written description requirement between these Guidelines 
and any earlier guidance provided from the Office. 

I. General Principles Governing Compliance With the ""Written 
Description' ' Requirement for Applications 

The first paragraph of 3 5 U.S.C. 112 requires that the 
""specification shall contain a written description of the invention * 
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* *.'» This requirement is separate and distinct from the enablement 
requirement . \1\ The written description requirement has several policy 
objectives. * ** [T] he "essential goal 1 of the description of the 
invention requirement is to clearly convey the information that an 
applicant has invented the subject matter which is claimed. ' * \2\ 
Another objective is to put the public in possession of what the 
applicant claims as the invention . \3\ The written description 
requirement of the Patent Act promotes the progress of the useful arts 
by ensuring that patentees adequately describe their inventions in 
their patent specifications in exchange for the right to exclude others 
from practicing the invention for the duration of the patent's term. 

To satisfy the written description requirement, a patent 
specification must describe the claimed invention in sufficient detail . 
that one skilled in the art can reasonably conclude that the inventor 
had possession of the claimed invention . \4\ An applicant shows 
possession of the claimed invention by describing the claimed invention 
with all of its limitations using such descriptive means as words, 
structures, figures, diagrams, and formulas that fully set forth the 
claimed invention. \5\ Possession may be shown in a variety of ways 
including description of an actual reduction to practice, \6\ or by 
showing that the invention was "ready for patenting' ' such as by the 
disclosure of drawings or structural chemical formulas that show that 
the invention was complete, \7\ or by describing distinguishing 
identifying characteristics sufficient to show that the applicant was 
in possession of the claimed invention. \8\ A question as to whether a 
specification provides an adequate written description may arise in the 
context of an original claim which is not described sufficiently, a new 
or amended claim wherein a claim limitation has been added or removed, 
or a claim to entitlement of an earlier priority date or effective 
filing date under 35 U.S.C. 119, 120, or 365(c). \9\ Compliance with the 
written description requirement is a question of 

[ [Page 1105] ] 

fact which must be resolved on a case-by-case basis. \10\ 

A. Original Claims 

There is a strong presumption that an adequate written description 
of the claimed invention is present when the application is filed. \11\ 
However, the issue of a lack of adequate written description may arise 
even for an original claim when an aspect of the claimed invention has 
not been described with sufficient particularity such that one skilled 
in the art would recognize that the applicant had possession of the 
claimed invention . \ 12 \ The claimed invention as a whole may not be 
adequately described if the claims require an essential or critical 
feature which is not adequately described in the specification and 
which is not conventional in the art or known to one of ordinary skill 
in the art.\13\ This problem may arise where an invention is described 
solely in terms of a method of its making coupled with its function and 
there is no described or art -recognized correlation or relationship 
between the structure of the invention and its function. \14\ A lack of 
adequate written description issue also arises if the knowledge and 
level of skill in the art would not permit one skilled in the art to 
immediately envisage the product claimed from the disclosed 
process . \15\ 

B. New or Amended Claims 

The proscription against the introduction of new matter in a patent 
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application \16\ serves to prevent an applicant from adding information 
that goes beyond the subject matter originally filed. \17\ Thus, the 
written description requirement prevents an applicant from claiming 
subject matter that was not adequately described in the specification 
as filed. New or amended claims which introduce elements or limitations 
which are not supported by the as -filed disclosure violate the written 
description requirement . \18\ While there is no in haec verba 
requirement, newly added claim limitations must be supported in the 
specification through express, implicit, or inherent disclosure. An 
amendment to correct an obvious error does not constitute new matter 
where one skilled in the art would not only recognize the existence of 
the error in the specification, but also recognize the appropriate 
correction. \19\ Deposits made after the application filing date cannot 
be relied upon to support additions to or correction of information in 
the application as filed. \20\ 

Under certain circumstances, omission of a limitation can raise an 
issue regarding whether the inventor had possession of a broader, more 
generic invention. \21\ A claim that omits an element which applicant 
describes as an essential or critical feature of the invention 
originally disclosed does not comply with the written description 
requirement . \22\ 

The fundamental factual inquiry is whether the specification 
conveys with reasonable clarity to those skilled in the art that, as of 
the filing date sought, applicant was in possession of the invention as 
now claimed. \23\ 

II. Methodology for Determining Adequacy of Written Description 

A. Read and Analyze the Specification for Compliance With 35 U.S.C. 
112, para. 1 

Office personnel should adhere to the following procedures when 
reviewing patent applications for compliance with the written 
description requirement of 35 U.S.C. 112, para. 1. The examiner has the 
initial burden, after a thorough reading and evaluation of the content 
of the application, of presenting evidence or reasons why a person 
skilled in the art would not recognize that the written description of 
the invention provides support for the claims. There is a strong 
presumption that an adequate written description of the claimed 
invention is present in the specification as filed; \24\ however, with 
respect to newly added or amended claims, applicant should show support 
in the original disclosure for the new or amended claims. \25\ 
Consequently, rejection of an original claim for lack of written 
description should be rare. The inquiry into whether the description 
requirement is met is a question of fact that must be determined on a 
case-by-case basis. \26\ 

1. For Each Claim, Determine What the Claim as a Whole Covers 

Claim construction is an essential part of the examination process. 
Each claim must be separately analyzed and given its broadest 
reasonable interpretation in light of and consistent with the written 
description. \27\ The entire claim must be considered, including the 
preamble language \28\ and the transitional phrase. \29\ The claim as a 
whole, including all limitations found in the preamble , \30\ the 
transitional phrase, and the body of the claim, must be sufficiently 
supported to satisfy the written description requirement . \31\ 

The examiner should evaluate each claim to determine if sufficient 
structures, acts, or functions are recited to make clear the scope and 
meaning of the claim, including the weight to be given the 
preamble . \32\ The absence of definitions or details for well- 
established terms or procedures should not be the basis of a rejection 
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under 35 U.S.C. 112, para. 1, for lack of adequate written description. 
Limitations may not, however, be imported into the claims from the 
specification. 

2. Review the Entire Application to Understand How Applicant Provides 
Support for the Claimed Invention Including Each Element and/or Step 

Prior to determining whether the disclosure satisfies the written 
description requirement for the claimed subject matter, the examiner 
should review the claims and the entire specification, including the 
specific embodiments, figures, and sequence listings, to understand how 
applicant provides support for the various features of the claimed 
invention. \3 3 \ The analysis of whether the specification complies with 
the written description requirement calls for the examiner to compare 
the scope of the claim with the scope of the description to determine 
whether applicant has demonstrated possession of the claimed invention. 
Such a review is conducted from the standpoint of one of skill in the 
art at the time the application was filed \34\ and should include a 
determination of the field of the invention and the level of skill and 
knowledge in the art. Generally, there is an inverse correlation 
between the level of skill and knowledge in the art and the specificity 
of disclosure necessary to satisfy the written description requirement. 
Information which is well known in the art need not be described in 
detail in the specif ication. \35\ 

3. Determine Whether There is Sufficient Written Description to Inform 
a Skilled Artisan That Applicant was in Possession of the Claimed 
Invention as a Whole at the Time the Application Was Filed 

a. Original claims. Possession may be shown in many ways. For 
example, possession may be shown, inter alia, by describing an actual 
reduction to practice of the claimed invention. Possession may also be 
shown by a clear depiction of the invention in detailed drawings or in 
structural chemical formulas which permit a person skilled in the art 
to clearly recognize that applicant had possession of the claimed 
invention. An adequate written description of the invention may be 
shown by any description of sufficient, relevant, identifying 
characteristics so long as a person skilled in the art would recognize 
that the inventor had possession of the claimed invention. \36\ 

A specification may describe an actual reduction to practice by 
showing 

[ [Page 1106] ] 

that the inventor constructed an embodiment or performed a. process that 
met all the limitations of the claim and determined that the invention 
would work for its intended purpose. \37\ Description of an actual 
reduction to practice of a biological material may be shown by 
specifically describing a deposit made in accordance with the 
requirements of 37 CFR 1.801 et seq.\38\ 

An applicant may show possession of an invention by disclosure of 
drawings \39\ or structural chemical formulas\40\ that are sufficiently 
detailed to show that applicant was in possession of the claimed 
invention as a whole. The description need only describe in detail that 
which is new or not conventional . \41\ This is equally true whether the 
claimed invention is directed to a product or a process. 

An applicant may also show that an invention is complete by 
disclosure of sufficiently detailed, relevant identifying 
characteristics \42\ which provide evidence that applicant was in 
possession of the claimed invention, \4 3 \ i.e., complete or partial 
structure, other physical and/or chemical properties, functional 
characteristics when coupled with a known or disclosed correlation 
between function and structure, or some combination of such 
characteristics . \44\ What is conventional or well known to one of 
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ordinary skill in the art need not be disclosed in detail. \45\ If a 
skilled artisan would have understood the inventor to be in possession 
of the claimed invention at the time of filing, even if every nuance of 
the claims is not explicitly described in the specification, then the 
adequate description requirement is met.\46\ 

(1) For each claim drawn to a single embodiment or species: \47\ 

(a) Determine whether the application describes an actual reduction 
to practice of the claimed invention. 

(b) If the application does not describe an actual reduction to 
practice, determine whether the invention is complete as evidenced by a 
reduction to drawings or structural chemical formulas that are 
sufficiently detailed to show that applicant was in possession of the 
claimed invention as a whole. 

(c) If the application does not describe an actual reduction to 
practice or reduction to drawings or structural chemical formula as 
discussed above, determine whether the invention has been set forth in 
terms of distinguishing identifying characteristics as evidenced by 
other descriptions of the invention that are sufficiently detailed to 
show that applicant was in possession of the claimed invention. 

(i) Determine whether the application as filed describes the 
complete structure (or acts of a process) of the claimed invention as a 
whole. The complete structure of a species or embodiment typically 
satisfies the requirement that the description be set forth vv in such 
full, clear, concise, and exact terms'' to show possession of the 
claimed invention . \48\ If a complete structure is disclosed, the 
written description requirement is satisfied for that species or 
embodiment, and a rejection under 35 U.S.C. 112, para. 1, for lack of 
written description must not be made. 

(ii) If the application as filed does not disclose the complete 
structure (or acts of a process) of the claimed invention as a whole, 
determine whether the specification discloses other relevant 
identifying characteristics sufficient to describe the claimed 
invention in such full, clear, concise, and exact terms that a skilled 
artisan would recognize applicant was in ' possession of the claimed 
invention. \4 9\ 

Whether the specification shows that applicant was in possession of 
the claimed invention is not a single, simple determination, but rather 
is a factual determination reached by considering a number of factors. 
Factors to be considered in determining whether there is sufficient 
evidence of possession include the level of skill and knowledge in the 
art, partial structure, physical and/or chemical properties, functional 
characteristics alone or coupled with a known or disclosed correlation 
between structure and function, and the method of making the claimed 
invention. Disclosure of any combination of such identifying 
characteristics that distinguish the claimed invention from other 
materials and would lead one of skill in the art to the conclusion that 
the applicant was in possession of the claimed species is 
suf f icient . \50\ Patents and printed publications in the art should be 
relied upon to determine whether an art is mature and what the level of 
knowledge and skill is in the art. In most technologies which are 
mature, and wherein the knowledge and level of skill in the art is 
high, a written description question should not be raised for original 
claims even if the specification discloses only a method of making the 
invention and the function of the invention. \ 5 1\ In contrast, for 
inventions in emerging and unpredictable technologies, or for 
inventions characterized by factors not reasonably predictable which 
are known to one of ordinary skill in the art, more evidence is 
required to show possession. For example, disclosure of only a method 
of making the invention and the function may not be sufficient to 
support a product claim other than a product-by-process claim. \52\ 
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Furthermore, disclosure of a partial structure without additional 
characterization of the product may not be sufficient to evidence 
possession of the claimed invention . \5 3 \ 

Any claim to a species that does not meet the test described under 
at least one of (a) , (b) , or (c) must be rejected as lacking adequate 
written description under 35 U.S.C. 112, para. 1. 

(2) For each claim drawn to a genus: 

The written description requirement for a claimed genus may be 
satisfied through sufficient description of a representative number of 
species by actual reduction to practice (see (1) (a) , above) , reduction 
to drawings (see (1) (b) , above) , or by disclosure of relevant, 
identifying characteristics, i.e., structure or other physical and/or 
chemical properties, by functional characteristics coupled with a known 
or disclosed correlation between function and structure, or by a 
combination of such identifying characteristics, sufficient to show the 
applicant was in possession of the claimed genus (see (1) (c) , 
above) . \54\ 

A ""representative number of species'* means that the species which 
are adequately described are representative of the entire genus. Thus, 
when there is substantial variation within the genus, one must describe 
a sufficient variety of species to reflect the variation within the 
genus. On the other hand, there may be situations where one species 
adequately supports a genus. \55\ What constitutes a ""representative 
number 1 1 is an inverse function of the skill and knowledge in the art. 
Satisfactory disclosure of a ""representative number' » depends on 
whether one of skill in the art would recognize that the applicant was 
in possession of the necessary common attributes or features of the 
elements possessed by the members of the genus in view of the species 
disclosed. For inventions in an unpredictable art, adequate written 
description of a genus which embraces widely variant species cannot be 
achieved by disclosing only one species within the genus. \56\ 
Description of a representative number of species does not require the 
description to be of such specificity that it would provide individual 
support for each species that the genus embraces . \57\ If a 
representative number of adequately described species are not disclosed 
for a genus, the claim to that genus must be rejected as lacking 
adequate written description under 3 5 U.S.C. 112, para. 1. 

b. New claims, amended claims, or claims asserting entitlement to 
the benefit of an earlier priority date or filing date under 35 U.S.C. 
119, 120, or 

[ [Page 1107] ] 

365(c). The examiner has the initial burden of presenting evidence or 
reasoning to explain why persons skilled in the art would not recognize 
in the original disclosure a description of the invention defined by 
the claims. \58\ However, when filing an amendment an applicant should 
show support in the original disclosure for new or amended claims. \59\ 
To comply with the written description requirement of 35 U.S.C. 112, 
para. 1, or to be entitled to an earlier priority date or filing date 
under 35 U.S.C. 119, 120, or 365(c), each claim limitation must be 
expressly, \60\ implicitly , \61\ or inherently \62\ supported in the 
originally filed disclosure . \63\ Furthermore, each claim must include 
all elements which applicant has described as essential . \64\ 

If the originally filed disclosure does not provide support for 
each claim limitation, or if an element which applicant describes as 
essential or critical is not claimed, a new or amended claim must be 
rejected under 35 U.S.C. 112, para. 1, as lacking adequate written 
description, or in the case of a claim for priority under 35 U.S.C. 
119, 120, or 365(c), the claim for priority must be denied. 
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111. Complete Patentability Determination Under All Statutory- 
Requirements and Clearly Communicate Findings, Conclusions, and 
Their Bases 

The above only describes how to determine whether the written 
description requirement of 35 U.S.C. 112, para. 1, is satisfied. 
Regardless of the outcome of that determination, Office personnel must 
complete the patentability determination under all the relevant 
statutory provisions of title 35 of the U.S. Code. 

Once Office personnel have concluded analysis of the claimed 
invention under all the statutory provisions, including 35 U.S.C. 101, 

112, 102, and 103, they should review all the proposed rejections and 
their bases to confirm their correctness. Only then should any 
rejection be imposed in an Office action. The Office action should 
clearly communicate the findings, conclusions, and reasons which 
support them. When possible, the Office action should offer helpful 
suggestions on how to overcome rejections. 

A. For Each Claim Lacking Written Description Support, Reject the Claim 
Under Section 112, para. 1, for Lack of Adequate Written Description 

A description as filed is presumed to be adequate, unless or until 
sufficient evidence or reasoning to the contrary has been presented by 
the examiner to rebut the presumption. \65\ The examiner, therefore, 
must have a reasonable basis to challenge the adequacy of the written 
description. The examiner has the initial burden of presenting by a 
preponderance of evidence why a person skilled in the art would not 
recognize in an applicant's disclosure a description of the invention 
defined by the claims. \66\ In rejecting a claim, the examiner must set 
forth express findings of fact regarding the above analysis which 
support the lack of written description conclusion. These findings 
should : 

(1) Identify the claim limitation at issue; and 

(2) Establish a prima facie case by providing reasons why a person 
skilled in the art at the time the application was filed would not have 
recognized that the inventor was in possession of the invention as 
claimed in view of the disclosure of the application as filed. A 
general allegation of ""unpredictability in the art' 1 is not a 
sufficient reason to support a rejection for lack of adequate written 
description. 

When appropriate, suggest amendments to the claims which can be 
supported by the application's written description, being mindful of 
the prohibition against the addition of new matter in the claims or 
description. \67\ 

B. Upon Reply by Applicant, Again Determine the Patentability of the 
Claimed Invention, Including Whether the Written Description 
Requirement Is Satisfied by Reperforming the Analysis Described Above 
in View of the Whole Record 

Upon reply by applicant, before repeating any rejection under 35 
U.S.C. 112, para. 1, for lack of written description, review the basis 
for the rejection in view of the record as a whole, including 
amendments, arguments, and any evidence submitted by applicant. If the 
whole record now demonstrates that the written description requirement 
is satisfied, do not repeat the rejection in the next Office action. If 
the record still does not demonstrate that the written description is 
adequate to support the claim ( s) , repeat the rejection under 3 5 U.S.C. 
112, para. 1, fully respond to applicant's rebuttal arguments, and 
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properly treat any further showings submitted by applicant in the 
reply. When a rejection is maintained, any affidavits relevant to the 
112, para. 1, written description requirement , \68\ must be thoroughly 
analyzed and discussed in the next Office action. 

Dated: December 29, 2000. 
Q. Todd Dickinson, 

Under Secretary of Commerce for Intellectual Property and Director of 
the United States Patent and Trademark Office. 

Endnotes 

\1\ see, e.g., Vas-Cath, Inc. v. Mahurkar, 935 F.2d 1555, 1560, 
19 USPQ2d 1111, 1114 (Fed. Cir. 1991). 

\2\ In re Barker, 559 F.2d 588, 592 n.4, 194 USPQ 470, 473 n.4 
(CCPA 1977) . 

\3\ See Regents of the University of California v. Eli Lilly, 
119 F.3d 1559, 1566, 43 USPQ2d 1398, 1404 (Fed. Cir. 1997), cert, 
denied, 523 U.S. 1089 (1998). 

\4\ See, e.g., Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 
USPQ2d at 1116. Much of the written description case law addresses 
whether the specification as originally filed supports claims not 
originally in the application. The issue raised in the cases is most 
often phrased as whether the original application provides 
""adequate support 1 ' for the claims at issue or whether the material 
added to the specification incorporates ""new matter' 1 in violation 
of 35 U.S.C. 132. The ""written description'* question similarly 
arises in the interference context, where the issue is whether the 
specification of one party to the interference can support the newly 
added claims corresponding to the count at issue, i.e., whether that 
party can ""make the claim'* corresponding to the interference 
count. See, e.g., Martin v. Mayer, 823 F.2d 500, 503, 3 USPQ2d 1333, 
1335 (Fed. Cir. 1987) . 

In addition, early opinions suggest the Patent and Trademark 
Office was unwilling to find written descriptive support when the 
only description was found in the claims; however, this viewpoint 
was rejected. See In re Koller, 613 F.2d 819, 204 USPQ 702 (CCPA 
1980) (original claims constitute their own description); accord In 
re Gardner, 475 F.2d 1389, 177 USPQ 396 (CCPA 1973); accord In re 
Wertheim, 541 F. 2d 257, 191 USPQ 90 (CCPA 1976) (accord). It is now 
well accepted that a satisfactory description may be in the claims 
or "any other portion of the originally filed specification. These 
early opinions did not address the quality or specificity of 
particularity that was required in the description, i.e., how much 
description is enough. 

\5\ Lockwood v. American Airlines, Inc., 107 F.3d 1565, 1572, 41 
USPQ2d 1961, 1966 (Fed. Cir. 1997) . 

\6\ An application specification may show actual reduction to 
practice by describing testing of the claimed invention or, in the 
case of biological materials, by specifically describing a deposit 
made in accordance with 37 CFR 1.801 et seq. See also Deposit of 
Biological Materials for Patent Purposes, Final Rule, 54 FR 34,864 
(August 22, 1989) (""The requirement for a specific identification 
is consistent with the description requirement of the first 
paragraph of 35 U.S.C. 112, and to provide an antecedent basis for 
the biological material which either has been or will be deposited 
before the patent is granted. 1 ' Id. at 34,876. ""The description 
must be sufficient to permit verification that the deposited 
biological material is in fact that disclosed. Once the 
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patent issues, the description must be sufficient to aid in the 
resolution of questions of infringement. 11 Id. at 34,880.). Such a 
deposit is not a substitute for a written description of the claimed 
invention. The written description of the deposited material needs 
to be as complete as possible because the examination for 
patentability proceeds solely on the basis of the written 
description. See, e.g., In re Lundak, 773 F.2d 1216, 227 USPQ 90 
(Fed. Cir. 1985). See also 54 FR at 34,880 ("As a general rule, the 
more information that is provided about a particular deposited 
biological material, the better the examiner will be able to compare 
the identity and characteristics of the deposited biological 
material with the prior art. ' ') . 

\7\ Pfaff v. Wells Electronics, Inc., 525 U.S. 55, 68, 119 S.Ct. 
304, 312, 48 USPQ2d 1641, 1647 (1998); Eli Lilly, 119 F.3d at 1568, 
43 USPQ2d at 1406. 

\8\ See Amgen, Inc. v. Chugai Pharmaceutical, 927 F.2d 1200, 
1206, 18 USPQ2d 1016, 1021 (Fed. Cir. 1991) (one must define a 
compound by "whatever characteristics sufficiently distinguish 
it ' ■ ) . 

\9\ A description requirement issue can arise for original 
claims (see, e.g., Eli Lilly, 119 F.3d 1559, 43 USPQ2d 1398) as well 
as new or amended claims. Most typically, the issue will arise in 
the context of determining whether new or amended claims are 
supported by the description of the invention in the application as 
filed (see, e.g., In re Wright, 866 F.2d 422, 9 USPQ2d 1649 (Fed. 
Cir. 1989)), whether a claimed invention is entitled to the benefit 
of an earlier priority date or effective filing date under 35 U.S.C. 
119, 120, or 365(c) (see, e.g., Tronzo v. Biomet, Inc., 156 F.3d 
1154, 47 USPQ2d 1829 (Fed. Cir. 1998); Fiers v. Revel, 984 F.2d 
1164, 25 USPQ2d 1601 (Fed. Cir. 1993); In re Ziegler, 992 F.2d 1197, 
1200, 26 USPQ2d 1600, 1603 (Fed. Cir. 1993)), or whether a 
specification provides support for a claim corresponding to a count 
in an interference (see, e.g., Fields v. Conover, 443 F.2d 1386, 170 
USPQ 276 (CCPA 1971) ) . 

\10\ Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 USPQ2d at 
1116 (Fed. Cir. 1991) . 

\11\ In re Wertheim, 541 F.2d 257, 263, 191 USPQ 90, 97 (CCPA 
1976) (""we are of the opinion that the PT0 has the initial burden 
of presenting evidence or reasons why persons skilled in the art 
would not recognize in the disclosure a description of the invention 
defined by the claims' ! ) . 

\12\ See endnote 4. 

\13\ For example, consider the claim "A gene comprising SEQ ID 
N0:1. I! A determination of what the claim as a whole covers may 
result in a conclusion that specific structures such as a promoter, 
a coding region, or other elements are included. Although all genes 
encompassed by this claim share the characteristic of comprising SEQ 
ID NO:l, there may be insufficient description of those specific 
structures (e.g., promoters, enhancers, coding regions, and other 
regulatory elements) which are also included. 

\14\ A biomolecule sequence described only by a functional 
characteristic, without any known or disclosed correlation between 
that function and the structure of the sequence, normally is not a 
sufficient identifying characteristic for written description 
purposes, even when accompanied by a method of obtaining the claimed 
sequence. For example, even though a genetic code table would 
correlate a known amino acid sequence with a genus of coding nucleic 
acids, the same table cannot predict the native, naturally occurring 
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nucleic acid sequence of a naturally occurring mRNA or its 
corresponding cDNA. Cf . In re Bell, 991 F.2d 781, 26 USPQ2d 1529 
(Fed. Cir. 1993), and In re Deuel, 51 F.3d 1552, 34 USPQ2d 1210 
(Fed. Cir. 1995) (holding that a process could not render the 
product of that process obvious under 35 U.S.C. 103) . The Federal 
Circuit has pointed out that under United States law, a description 
that does not render a claimed invention obvious cannot sufficiently 
describe the invention for the purposes of the written description 
requirement of 35 U.S.C. 112. Eli Lilly, 119 F. 3d at 1567, 43 USPQ2d 
at 1405. 

Compare Fonar Corp. v. General Electric Co., 107 F.3d 1543, 
1549, 41 USPQ2d 1801, 1805 (Fed. Cir. 1997) ( " v As a general rule, 
where software constitutes part of a best mode of carrying out an 
invention, description of such a best mode is satisfied by a 
disclosure of the functions of the software. This is because, 
normally, writing code for such software is within the skill of the 
art, not requiring undue experimentation, once its functions have 
been disclosed. * * * Thus, flow charts or source code listings are 
not a requirement for adequately disclosing the functions of 
software . ' ' ) . 

\15\ See, e.g., Fujikawa v. Wattanasin, 93 F.3d 1559, 1571, 39 
USPQ2d 1895, 1905 (Fed. Cir. 1996) (a "laundry list 1 ' disclosure of 
every possible moiety does not constitute a written description of 
every species in a genus because it would not "reasonably lead' 1 
those skilled in the art to any particular species) ; In re Ruschig, 
379 F.2d 990, 995, 154 USPQ 118, 123 (CCPA 1967) (""If n-propylamine 
had been used in making the compound instead of n-butylamine , the 
compound of claim 13 would have resulted. Appellants submit to us, 
as they did to the board, an imaginary specific example patterned on 
specific example 6 by which the above butyl compound is made so that 
we can see what a simple change would have resulted in a specific 
supporting disclosure being present in the present specification. 
The trouble is that there is no such disclosure, easy though it is 
to imagine it. 1 ') (emphasis in original); Purdue Pharma L.P. v. 
Faulding Inc., 230 F.3d 1320, 1328, 56 USPQ2d 1481, 1487 (Fed. Cir. 
2000) ("the specification does not clearly disclose to the skilled 
artisan that the inventors * * * considered the [] ratio to be part 
of their invention * * *. There is therefore no force to Purdue's 
argument that the written description requirement was satisfied 
because the disclosure revealed a broad invention from which the 
[later-filed] claims carved out a patentable portion'*). 

16 35 U.S.C. Sees. 132 and 251. See also In re 

Rasmussen, 650 F.2d 1212, 1214, 211 USPQ 323, 326 (CCPA 1981). See 
Manual of Patent Examining Procedure (MPEP) Sees. 2163.06-2163.07 
(7th Ed., Rev, 1, Feb. 2000) for a more detailed discussion of the 
written description requirement and its relationship to new matter. 

17 The claims as filed in the original specification 

are part of the disclosure and, therefore, if an application as 
originally filed contains a claim disclosing material not found in 
the remainder of the specification, the applicant may amend the 
specification to include the claimed subject matter. In re Benno, 
768 F.2d 1340, 226 USPQ 683 (Fed. Cir. 1985). 

18 See, e.g., In re Lukach, 442 F.2d 967, 169 USPQ 

7 95 (CCPA 1971) (subgenus range was not supported by generic 
disclosure and specific example within the subgenus range) ; In re 
Smith, 458 F.2d 1389, 1395, 173 USPQ 679, 683 (CCPA 1972) (a 
subgenus is not necessarily described by a genus encompassing it and 
a species upon which it reads) . 

19 In re Oda, 443 F.2d 1200, 170 USPQ 260 (CCPA 

1971) . With respect to the correction of sequencing errors in 
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applications disclosing nucleic acid and/or amino acid sequences, it 
is well known that sequencing errors are a common problem in 
molecular biology. See, e.g., Peter Richterich, Estimation of Errors 
in "Raw' DNA Sequences: A Validation Study, 8 Genome Research 251-59 
(1998) . If an application as filed includes sequence information and 
references a deposit of the sequenced material made in accordance 
with the requirements of 37 CFR Sec. 1.801 et seq., amendment may be 
permissible . 

20 Corrections of minor errors in the sequence may be 
possible based on the argument that one of skill in the art would 
have resequenced the deposited material and would have immediately 
recognized the minor error. Deposits made after the filing date can 
only be relied upon to provide support for the correction of 
sequence information if applicant submits a statement in compliance 
with 3 7 CFR Sec. 1.804 stating that the biological material which is 
deposited is a biological material specifically defined in the 
application as filed. 

21 See, e.g., Gentry Gallery, Inc. v. Berkline Corp., 
134 F.3d 1473, 45 USPQ2d 1498 (Fed. Cir. 1998) (claims to . a 
sectional sofa comprising, inter alia, a console and a control means 
were held invalid for failing to satisfy the written description 
requirement where the claims were broadened by removing the location 
of the control means.); Johnson Worldwide Associates v. Zebco Corp., 
175 F.3d 985, 993, 50 USPQ2d 1607, 1613 (Fed. Cir. 1999) (In Gentry 
Gallery, the "court's determination that the patent disclosure did 
not support a broad meaning for the disputed claim terms was 
premised on clear statements in the written description that 
described the location of a claim element—the "control means" --as 
"the only possible location* and that variations were "outside the 
stated purpose of the invention. ' Gentry Gallery, 134 F.3d at 1479, 
45 USPQ2d at 1503. Gentry Gallery, then, considers the situation 
where the patent's disclosure makes crystal clear that a particular 
(i.e., narrow) understanding of a claim term is an "essential 
element of [the inventor's] invention.' 11 ); Tronzo v. Biomet, 156 
F.3d at 1158-59, 47 USPQ2d at 1833 (Fed. Cir. 1998) (claims to 
generic cup shape were not entitled to filing date of parent 
application which disclosed ""conical cup 1 ' in view of the 
disclosure of the 

[ [Page 1109] ] 

parent application stating the advantages and importance of the 
conical shape.). 

22 See Gentry Gallery, 134 F.3d at 1480, 45 USPQ2d at 

1503; In re Sus, 306 F.2d 494, 504, 134 USPQ 301, 309 (CCPA 1962) 
(""[0]ne skilled in this art would not be taught by the written 
description of the invention in the specification that any "aryl or 
substituted aryl radical' would be suitable for the purposes of the 
invention but rather that only certain aryl radicals and certain 
specifically substituted aryl radicals [i.e., aryl azides] would be 
suitable for such purposes.'') (emphasis in original). A claim which 
omits matter disclosed to be essential to the invention as described 
in the specification or in other statements of record may also be 
subject to rejection under 35 U.S.C. 112, para. 1, as not enabling, 
or under 35 U.S.C. 112, para. 2. See In re Mayhew, 527 F.2d 1229, 
188 USPQ 356 (CCPA 1976); In re Venezia, 530 F.2d 956, 189 USPQ 149 
(CCPA 1976); and In re Collier, 397 F.2d 1003', 158 USPQ 266 (CCPA 
1968). See also MPEP Sec. 2172.01. 

23 See, e.g., Vas-Cath, Inc., 935 F.2d at 1563-64, 19 
USPQ2d at 1117. 
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24 Wertheim, 541 F.2d at 262, 191 USPQ at 96. 

25 See MPEP Sees. 714.02 and 2163.06 ("Applicant 

should * * * specifically point out the support for any amendments 
made to the disclosure.''); and MPEP Sec. 2163.04 (""If applicant 
amends the claims and points out where and/or how the originally 
filed disclosure supports the amendment ( s ) , and the examiner finds 
that the disclosure does not reasonably convey that the inventor had 
possession of the subject matter of the amendment at the time of the 
filing of the application, the examiner has the initial burden of 
presenting evidence or reasoning to explain why persons skilled in 
the art would not recognize in the disclosure a description of the 
invention defined by the claims. ' ') . 

26 See In re Smith, 458 F.2d 1389, 1395, 173 USPQ 

679, 683 (CCPA 1972) (""Precisely how close [to the claimed 
invention] the description must come to comply with Sec. 112 must be 
left to case-by-case development.' 1 ); In re Wertheim, 541 F.2d at 
262, 191 USPQ at 96 (inquiry is primarily factual and depends on the 
nature of the invention and the amount of knowledge imparted to 
those skilled in the art by the disclosure) . 

27 See, e.g., In re Morris, 127 F.3d 1048, 1053-54, 
44 USPQ2d 1023, 1027 (Fed. Cir. 1997). 

28 ""Preamble language ' 1 is that language in a claim 
appearing before the transitional phase, e.g., before 
""comprising, 1 ' ""consisting essentially of, ' ' or ""consisting of . ' ' 

29 The transitional term " "comprising 1 * (and other 
comparable terms, e.g., ""containing,'' ""including,' 1 and 
""having 1 ') is " "open-ended- -it covers the expressly recited subject 
matter, alone or in combination with unrecited subject matter. See, 
e.g., Genentech, Inc. v. Chiron Corp., 112 F.3d 495, 501, 42 USPQ2d 
1608, 1613 (Fed. Cir. 1997) ("" "Comprising' is a term of art used 
in claim language which means that the named elements are essential, 
but other elements may be added and still form a construct within 
the scope of the claim. •') ; Ex parte Davis, 80 USPQ 448, 450 (Bd. 
App. 1948) (""comprising 1 ' leaves the ""claim open for the inclusion 
of unspecified ingredients even in major amounts' 1 ) . ""By using the 
term "consisting essentially of, ' the drafter signals that the 
invention necessarily includes the listed ingredients and is open to 
unlisted ingredients that do not materially affect the basic and 
novel properties of the invention. A "consisting essentially of 
claim occupies a middle ground between closed claims that are 
written in a "consisting of format and fully open claims that- are 
drafted in a "comprising* format.' 1 PPG Industries v. Guardian 
Industries, 156 F.3d 1351, 1354, 48 USPQ2d 1351, 1353-54 (Fed. Cir. 
1998) . For the purposes of searching for and applying prior art 
under 35 U.S.C. 102 and 103, absent a clear indication in the 
specification or claims of what the basic and novel characteristics 
actually are, "consisting essentially of will be construed as 
equivalent to ""comprising. 1 ' See, e.g., PPG, 156 F.3d at 1355, 48 
USPQ2d at 1355 {""PPG could have defined the scope of the phrase 
""consisting essentially of' for purposes of its patent by making 
clear in its specification what it regarded as constituting a 
material change in the basic and novel characteristics of the 
invention.''). See also In re Janakirama-Rao, 317 F.2d 951, 954, 137 
USPQ 893, 895-96 (CCPA 1963) . If an applicant contends that 
additional steps or materials in the prior art are excluded by the 
recitation of ""consisting essentially of,'' applicant has the 
burden of showing that the introduction of additional steps or 
components would materially change the characteristics of 
applicant's invention. In re De Lajarte, 337 F.2d 870, 143 USPQ 256 
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(CCPA 1964) . 

30 See Pac-Tec Inc. v. Amerace Corp., 903 F.2d 796, 

801, 14 USPQ2d 1871, 1876 (Fed. Cir. 1990) (determining that 
preamble language that constitutes a structural limitation is 
actually part of the claimed invention) . 

31 An applicant shows possession of the claimed 
invention by describing the claimed invention with all of its 
limitations. Lockwood, 107 F.3d at 1572, 41 USPQ2d at 1966. 

32 See, e.g., Bell Communications Research, Inc. v. 
Vitalink Communications Corp., 55 F.3d 615, 620, 34 USPQ2d 1816, 
1820 (Fed. Cir. 1995) (""[A] claim preamble has the import that the 
claim as a whole suggests for it. 1 *); Corning Glass Works v. 
Sumitomo Elec . U.S.A., Inc., 868 F.2d 1251, 1257, 9 USPQ2d 1962, 
1966 (Fed. Cir. 1989) (The determination of whether preamble 
recitations are structural limitations can be resolved only on 
review of the entirety of the application ""to gain an understanding 
of what the inventors actually invented and intended to encompass by 
the claim. 1 • ) . 

33 An element may be critical where those of skill in 
the art would require it to determine that applicant was in 
possession of the invention. Compare Rasmussen, 650 F.2d at 1215, 
211 USP.Q at 327 ("one skilled in the art who read Rasmussen* s 
specification would understand that it is unimportant how the layers 
are adhered, so long as they are adhered 1 ') (emphasis in original), 
with Amgen, Inc. v. Chugai Pharmaceutical Co., Ltd., 927 F.2d 1200, 
1206, 18 USPQ2d 1016, 1021 (Fed. Cir. 1991) ("it is well 
established in our law that conception of a chemical compound 
requires that the inventor be able to define it so as to distinguish 
it from other materials, and to describe how to obtain it' ') . 

34 See, e.g., Wang Labs. v. Toshiba Corp., 993 F.2d 
858, 865, 26 USPQ2d 1767, 1774 (Fed. Cir. 1993). 

35 See, e.g., Hybritech, Inc. v. Monoclonal 

Antibodies, Inc., 802 F.2d 1367, 1379-80, 231 USPQ 81, 90 (Fed. Cir. 
1986) . 

36 See, e.g., Purdue Pharma L.P. v. Faulding Inc., 

230 F.3d 1320, , 56 USPQ2d 1481, 1483 (Fed. Cir. 2000) (the 

written description "inquiry is a factual one and must be assessed 
on a case-by-case basis''); see also Pfaff v. Wells Electronics, 
Inc., 55 U.S. at 66, 119 S . Ct . at 311, 48 USPQ2d at 1646 (""The word 
"invention* must refer to a concept that is complete, rather than 
merely one that is "substantially complete. ' It is true that 
reduction to practice ordinarily provides the best evidence that an 
invention is complete. But just because reduction to practice is 
sufficient evidence of completion, it does not follow that proof of 
reduction to practice is necessary in every case. Indeed, both the 
facts of the Telephone Cases and the facts of this case demonstrate 
that one can prove that an invention is complete and ready for 
patenting before it has actually been reduced to practice.' ') . 

37 Cooper v. Goldfarb, 154 F.3d 1321, 1327, 47 USPQ2d 
1896, 1901 (Fed. Cir. 1998). See also UMC Elecs. Co. v. United 
States, 816 F.2d 647, 652, 2 USPQ2d 1465, 1468 (Fed. Cir. 1987) 
(""[T]here cannot be a reduction to practice of the invention * * * 
without a physical embodiment which includes all limitations of the 
claim.' »); Estee Lauder Inc. v. L'Oreal, S.A., 129 F.3d 588, 593, 44 
USPQ2d 1610, 1614 (Fed. Cir. 1997) (""[A] reduction to practice does 
not occur until the inventor has determined that the invention will 
work for its intended purpose.*'); Mahurkar v. C.R. Bard, Inc., 79 
F.3d 1572, 1578, 38 USPQ2d 1288, 1291 {Fed. Cir. 1996) (determining 
that the invention will work for its intended purpose may require 
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testing depending on the character of the invention and the problem 
it solves) . 

38 37 CFR 1.804, 1.809. See also endnote 6. 

39 See, e.g., Vas-Cath, 935 F.2d at 1565, 19 USPQ2d 

at 1118 (""drawings alone may provide a "written description' of an 
invention as required by Sec. 112 ' * ) ; In re Wolf ensperger , 302 F.2d 
950, 133 USPQ 537 (CCPA 1962) (the drawings of applicant 1 s 
specification provided sufficient written descriptive support for 
the claim limitation at issue); Autogiro Co. of America v. United 
States, 384 F.2d 391, 398, 155 USPQ 697, 703 (Ct. CI. 1967) C"In 
those instances where a visual representation can flesh out words, 
drawings may be used in the same manner and with the same 
limitations as the specification. 1 '). 

40 See, e.g., Eli Lilly, 119 F.3d at 1568, 43 USPQ2d 

at 1406 (""In claims involving chemical materials, generic formulae 
usually indicate with specificity what the generic claims encompass. 
One skilled in the art can distinguish such a formula from others 
and can identify many of the species that the claims encompass. 
Accordingly, such a formula is normally an adequate description of 
the claimed genus . ' ' ) . 
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41 See Hybritech v. Monoclonal Antibodies, 802 F.2d 

at 1384, 231 USPQ at 94; Fonar Corp. v. General Electric Co., 107 
F.3d at 1549, 41 USPQ2d at 1805 (source code description not 
required) . 

42 For example, the presence of a restriction enzyme 

map of a gene may be relevant to a statement that the gene has been 
isolated. One skilled in the art may be able to determine when the 
gene disclosed is the same as or different from a gene isolated by 
another by comparing the restriction enzyme map. In contrast, 
evidence that the gene could be digested with a nuclease would not 
normally represent a relevant characteristic since any gene would be 
digested with a nuclease. Similarly, isolation of an mRNA and its 
expression to produce the protein of interest is strong evidence of 
possession of an mRNA for the protein. 

For some biomolecules , examples of identifying characteristics 
include a sequence, structure, binding affinity, binding 
specificity, molecular weight, and length. Although structural 
formulas provide a convenient method of demonstrating possession of 
specific molecules, other identifying characteristics or 
combinations of characteristics may demonstrate the requisite 
possession. For example, unique cleavage by particular enzymes, 
isoelectric points of fragments, detailed restriction enzyme maps, a 
comparison of enzymatic activities, or antibody cross-reactivity may 
be sufficient to show possession of the claimed invention to one of 
skill in the art. See Lockwood, 107 F.3d at 1572, 41 USPQ2d at 1966 
(""written description' 1 requirement may be satisfied by using 
""such descriptive means as words, structures, figures, diagrams, 
formulas, etc., that fully set forth the claimed invention 1 '). 

43 A definition by function alone ""does not 

suffice' ' to sufficiently describe a coding sequence ""because it is 
only an indication of what the gene does, rather than what it is.' 1 
Eli Lilly, 119 F.3 at 1568, 43 USPQ2d at 1406. See also Fiers, 984 
F.2d at 1169-71, 25 USPQ2d at 1605-06 (discussing Amgen Inc. v. 
Chugai Pharmaceutical Co., 927 F.2d 1200, 18 USPQ2d 1016 (Fed. Cir. 
1991) ) . 

44 If a claim limitation invokes 35 U.S.C. 112, para. 
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6, it must be interpreted to cover the corresponding structure, 
materials, or acts in the specification and ""equivalents thereof.' 1 
See 35 U.S.C. 112, para. 6. See also B. Braun Medical, Inc. v. 
Abbott Lab., 124 F.3d 1419, 1424, 43 USPQ2d 1896, 1899 (Fed. Cir. 
1997). In considering whether there is 35 U.S.C. 112, para. 1, 
support for a means- (or step) plus -function claim limitation, the 
examiner must consider not only the original disclosure contained in 
the summary and detailed description of the invention portions of 
the specification, but also the original claims, abstract, and 
drawings. A means- (or step-) plus -function claim limitation is 
adequately described under 35 U.S.C. 112, para. 1, if: (1) The 
written description adequately links or associates adequately 
described particular structure, material, or acts to the function 
recited in a means- (or step-) plus-function claim limitation; or 
(2) it is clear based on the facts of the application that one 
skilled in the art would have known what structure, material, or 
acts perform the function recited in a means- (or step-) plus- 
function limitation. Note also: A rejection under 35 U.S.C. 112, 
para. 2, ""cannot stand where there is adequate description in the 
specification to satisfy 35 U.S.C. 112, first paragraph, regarding 
means-plus-function recitations that are not, per se, challenged for 
being unclear. ■ • In re Noll, 545 F.2d 141, 149, 191 USPQ 721, 727 
(CCPA 1976) . See Supplemental Examination Guidelines for Determining 
the Applicability of 35 U.S.C. 112, para. 6, 65 FR 38510, June 21, 
2000 . 

45 See Hybritech Inc. v. Monoclonal Antibodies, Inc., 
802 F.2d at 1384, 231 USPQ at 94. 

46 See, e.g., Vas-Cath, 935 F.2d at 1563, 19 USPQ2d 

at 1116; Martin v. Johnson, 454 F.2d 746, 751, 172 USPQ 391, 395 
(CCPA 1972) (stating ""the description need not be in ipsis verbis 
[i.e., ""in the same words 1 ' ] to be suf f icient * * ) . 

47 A claim which is limited to a single disclosed 
embodiment or species is analyzed as a claim drawn to a single 
embodiment or species, whereas a claim which encompasses two or more 
embodiments or species within the scope of the claim is analyzed as 
a claim drawn to a genus. See also MPEP Sec. 806.04(e) . 

48 35 U.S.C. 112, para. 1. Cf . Fields v. Conover, 443 

F.2d 1386, 1392, 170 USPQ 276, 280 (CCPA 1971) (finding a lack of 
written description because the specification lacked the ""full, 
clear, concise, and exact written description' ' which is necessary 
to support the claimed invention) . 

49 For example, if the art has established a strong 
correlation between structure and function, . one skilled in the art 
would be able to predict with a reasonable degree of confidence the 
structure of the claimed invention from a recitation of its 
function. Thus, the written description requirement may be satisfied 
through disclosure of function and minimal structure when there is a 
well-established correlation between structure and function. In 
contrast, without such a correlation, the capability to recognize or 
understand the structure from the mere recitation of function and 
minimal structure is highly unlikely. In this latter case, 
disclosure of function alone is little more than a wish for 
possession; it does not satisfy the written description requirement. 
See Eli Lilly, 119 F.3d at 1568, 43 USPQ2d at 1406 (written 
description requirement not satisfied by merely providing ""a result 
that one might achieve if one made that invention' ') ; In re Wilder, 
736 F.2d 1516, 1521, 222 USPQ 369, 372-73 (Fed. Cir. 1984) 
(affirming a rejection for lack of written description because the 
specification does ""little more than outline goals appellants hope 
the claimed invention achieves and the problems the invention will 
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hopefully ameliorate''). Compare Fonar, 107 F.3d at 1549, 41 USPQ2d 
at 1805 (disclosure of software function adequate in that art) . 

50 See Eli Lilly, 119 F.3d at 1568, 43 USPQ2d at 
1406. 

51 See, e.g., In re Hayes Microcomputer Products, 

Inc. Patent Litigation, 982 F.2d 1527, 1534-35, 25 USPQ2d 1241, 1246 
(Fed. Cir. 1992) ("One skilled in the art would know how to program 
a microprocessor to perform the necessary steps described in the 
specification. Thus, an inventor is not required to describe every 
detail of his invention. An applicant's disclosure obligation varies 
according to the art to which the invention pertains. Disclosing a 
microprocessor capable of performing certain functions is sufficient 
to satisfy the requirement of section 112, first paragraph, when one 
skilled in the relevant art would understand what is intended and 
know how to carry it out . ' * ) 

52 See, e.g., Fiers v. Revel, 984 F.2d at 1169, 25 

USPQ2d at 1605; Amgen. , 927 F.2d at 1206, 18 USPQ2d atl021. Where 
the process has actually been used to produce the product, the 
written description requirement for a product -by-process claim is 
clearly satisfied; however, the requirement may not be satisfied 
where it is not clear that the acts set forth in the specification 
can be performed, or that the product is produced by that process. 

53 See, e.g., Amgen, 927 F.2d atl206, 18 USPQ2d at 

1021 ("A gene is a chemical compound, albeit a complex one, and it 
is well established in our law that conception of a chemical 
compound requires that the inventor be able to define it so as to 
distinguish it from other materials, and to describe how to obtain 
it. Conception does not occur unless one has a mental picture of the 
structure of the chemical, or is able to define it by its method of 
preparation, its physical or chemical properties, or whatever 
characteristics sufficiently distinguish it. It is not sufficient to 
define it solely by its principal biological property, e.g., 
encoding human erythropoietin, because an alleged conception having 
no more specificity than that is simply a wish to know the identity 
of any material with that biological property. We hold that when an 
inventor is unable to envision the detailed constitution of a gene 
so as to distinguish it from other materials, as well as a method 
for obtaining it, conception has not been achieved until reduction . 
to practice has occurred, i.e., until after the gene has been 
isolated.' 1 ) (citations omitted). In such instances the alleged 
conception fails not merely because the field is unpredictable or 
because of the general uncertainty surrounding experimental 
sciences, but because the conception is incomplete due to factual 
uncertainty that undermines the specificity of the inventor's idea 
of the invention. Burroughs Wellcome Co. v. Barr Laboratories Inc., 
40 F.3d 1223, 1229, 32 USPQ2d 1915, 1920 (Fed. Cir. 1994). Reduction 
to practice in effect provides the only evidence to corroborate 
conception (and therefore possession) of the invention. Id. 

54 See Eli Lilly, 119 F . 3d at 1568, 43 USPQ2d at 
1406. 

55 See, e.g., Rasmussen, 650 F.2d at 1214, 211 USPQ 

at 326-27 (disclosure of a single method of adheringly applying one 
layer to another was sufficient to support a generic claim to 
""adheringly applying'' because one skilled in the art reading the 
specification would understand that it is unimportant how the layers 
are adhered, so long as they are adhered); In re Herschler, 591 F.2d 
693, 697, 200 USPQ 711, 714 (CCPA 1979) (disclosure of 
corticosteriod in DMSO sufficient to support claims drawn to a 
method of using a mixture of a ""physiologically active steroid'' 
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and DMSO because "use of known chemical compounds in a manner 
auxiliary 

[ [Page 1111] ] 

to the invention must have a corresponding written description only 
so specific as to lead one having ordinary skill in the art to that 
class of compounds. Occasionally, a functional recitation of those 
known compounds in the specification may be sufficient as that 
description. 1 '); In re Smythe, 480 F.2d 1376, 1383, 178 USPQ 279, 
285 (CCPA 1973) (the phrase ""air or other gas which is inert to the 
liquid' * was sufficient to support a claim to ""inert fluid media 1 ' 
because the description of the properties and functions of the air 
or other gas segmentizing medium would suggest to a person skilled 
in the art that appellant's invention includes the use of ""inert 
fluid' ' broadly.). However, in Tronzo v. Biomet, 156 F.3d at 1159, 
47 USPQ2d atl833 (Fed. Cir. 1998), the disclosure of a species in 
the parent application did not suffice to provide written 
description support for the genus in the child application. 

56 See, e.g., Eli Lilly. 

57 For example, in the molecular biology arts, if an 
applicant disclosed an amino acid sequence, it would be unnecessary 
to provide an explicit disclosure of nucleic acid sequences that 
encoded the amino acid sequence. Since the genetic code is widely 
known, a disclosure of an amino acid sequence would provide 
sufficient information such that one would accept that an applicant 
was in possession of the full genus of nucleic acids encoding a 
given amino acid sequence, but not necessarily any particular 
species. Cf. In re Bell, 991 F.2d 781, 785, 26 USPQ2d 1529, 1532 
(Fed. Cir. 1993) and In re Baird, 16 F.3d 380, 382, 29 USPQ2d 1550, 
1552 (Fed. Cir. 1994) . 

58 See Wertheim, 541 F.2d at 263, 191 USPQ at 97 
(""[T]he PT0 has the initial burden of presenting evidence or 
reasons why persons skilled in the art would not recognize in the 
disclosure a description of the invention defined by the claims. 1 '). 

59 See MPEP Sees. 714.02 and 2163.06 (""Applicant 

should * * * specifically point out the support for any amendments 
made to the disclosure. ' ' ) . 

60 See, e.g., In re Wright, 866 F.2d 422, 425, 9 

USPQ2d 1649, 1651 (Fed. Cir. 1989) (Original specification for 
method of forming images using photosensitive microcapsules which 
describes removal of microcapsules from surface and warns that 
capsules not be disturbed prior to formation of image, unequivocally 
teaches absence of permanently fixed microcapsules and supports 
amended language of claims requiring that microcapsules be ""not 
permanently fixed' 1 to underlying surface, and therefore meets 
description requirement of 35 U.S.C. 112.). 

61 See, e.g., In re Robins, 429 F.2d 452, 456-57, 166 

USPQ 552, 555 (CCPA 1970) ( " " [W] here no explicit description of a 
generic invention is to be found in the specification * * * mention 
of representative compounds may provide an implicit description upon 
which to base generic claim language .' 1 ) ; In re Smith, 458 F.2d 
1389, 1395, 173 USPQ 679, 683 (CCPA 1972) (a subgenus is not 
necessarily implicitly described by a genus encompassing it and a 
species upon which it reads) . 

62 See, e.g., In re Robertson, 169 F.3d 743, 745, 49 

USPQ2d 1949, 1950-51 (Fed. Cir. 1999) (""To establish inherency, the 
extrinsic evidence ""must make clear that the missing descriptive 
matter is necessarily present in the thing described in the 
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reference, and that it would be so recognized by persons of ordinary- 
skill. Inherency, however, may not be established by probabilities 
or possibilities. The mere fact that a certain thing may result from 
a given set of circumstances is not suf f icient . 1 * ' * ) (citations 
omitted) . 

63 When an explicit limitation in a claim "is not 

present in the written description whose benefit is sought it must 
be shown that a person of ordinary skill would have understood, at 
the time the patent application was filed, that the description 
requires that limitation. 1 1 Hyatt v. Boone, 146 F.3d 1348, 1353, 47 
USPQ2d 1128, 1131 (Fed. Cir. 1998) . 

64 See, e.g., Johnson Worldwide Associates Inc. v. 

Zebco Corp., 175 F.3d at 993, 50 USPQ2d at 1613; Gentry Gallery, 
Inc. v. Berkline Corp., 134 F.3d at 1479, 45 USPQ2d at 1503; Tronzo 
v. Biomet, 156 F.3d at 1159, 47 USPQ2d at 1833. 

65 See, e.g., In re Marzocchi, 439 F.2d 220, 224, 169 
USPQ 367, 370 (CCPA 1971) . 

66 Wertheim, 541 F.2d at 263, 191 USPQ at 97. 

67 See Rasmussen, 650 F.2d at 1214, 211 USPQ at 326. 

68 See In re Alton, 76 F.3d 1168, 1176, 37 USPQ2d 
1578, 1584 (Fed. Cir. 1996). 
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High-throughput technologies, such as proteomic screening and DNA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene, we analyzed a novel micro-array expression dataset 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r = 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more. 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. 

Keywords: bioinformatics * micro-array • text mining • gene-disease association • breast cancer 



Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g.. 
breast cancer, a researcher would have had to scan 130 different 
journals and read 27 papers per day in 1999.' This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomics, which require the analysis of 
large daiasets involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the~study7The ability toTnTerpret these daTasets™ 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database. 2 However, as these resources are hand- 
cu rated, the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 

* To whom correspondence should be addressed; jlabaer@hms.harvard.edu. 
10-102l/pr0340227 CCC: S25.00 © 2003 American Chemical Society 



databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
the literature or by functional annotation. 3-7 Thus far, few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez- Iratxeta etal. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked ^inherited disorders' 8 ^" ~~ 

To obtain a.morc global understanding of disease develop- 
ment, it would be valuable to incorporate information regarding 
all possible gehe-disease relationships, including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental daiasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, It would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context. We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this tool to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedCene is a relational database, stor- 
ing disease and gene information from NCBI, text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interface for users to 
query the database (http://hipseq.med.harvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http:// 
www.nlm.nih.gov/mesh/meshhome.html) and human disease 
categories were selected. LocusLink files were downloaded from 
the LocusLink web site at NCBI (http://www,ncbi. nih.gov/ 
LocusLink/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominantly qualified gene 
family terms and gene official/preferred symbols, were used 
to Index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July, 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene- only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chi-square analysis. Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 16 (http;// 
hipseq.med.harvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 

* proportion of disease/gene-double hits to disease single-hits 

and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, wc transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using TreeViewer" (http:// 
rana.lbl.gov/EisenSoftware.htm). 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER- positive) and 7 normal breast tissue samples • 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinylated cRNA. generated from th» 
total RNA extracted from the bulk tumor, was hybridized to 
Asymetrix U95A oligo-nucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from AITymetrix. and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 9 * 10 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information, Supplemental Table 1. or visit http://hipseq. 
med.harvard.edu/MedGene/publication/s_Table l.html) were 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list of genes, gene names, gene symbols, and frequently used 
synonyms were collected from the LocusLink database at 
NCBI. 11,2 which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 ,h . 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at ail. 
For the intended use of this study, this lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table I). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
—to reduce these errors. _The syntax Jssues^erej^adUyJiandle^ _ 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract. Case-insensitive string mapping was used for all 
searches except as noted above. No additional weight was 



Analysis of Data Using Advan^^terature Mining 

Table 1. Systematic Sources of False Positives and False Negatives in Unfiltered Data* 



research articles 



source of error 



error type 



example 



filler solution 



gene symbol/name 
is not unique 



gene symbol is 

unrelated abbreviation 
genesymbol/name 

has language meaning 
nonstandard syntax 
unofficial gene name/symbol 
nonspeclfled gene name 



false positive MAC— myelin 

associuied glycoprotein 
MAG~ malignancy-associated 
protein 

false positive /M — pallid homologue (mouse). 

palltdin (also abbrev, for Pennsylvania) 
false positive lY/15-Wiskoit- Aldrich Syndrome 

(also the word "was") 
false negative BAG- 1 instead of BAG I 
false negative P53 instead of TP53 
false negative estrogen receptor instead of 

Estrogen receptor 1 



eliminate this term 

eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

add family stem term 



* In preliminary studies. Medline was searched for co-occurrence of genes and diseases and the resulting output was evaluated to identify error sources thai 
were amenable to global filters, iach error source is categorized by the type of error it causes: false positives are suggested relationships that arc not real and 
false negatives are real relationships that are underre presented. The filter solutions used are Indicated. Note that In some cases, the filler solution itself Introduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity If needed. 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with all qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g., estrogen receptor instead of 
estrogen receptor 1 {ESR1), creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
genes that have an alpha or numerical suffix (e.g.. IL2RA, TGFfi, 
ESRL etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that it would be clear when linkages were made to 
the gene family versus a specific member in that family. 

To improve performance and accuracy, some prc-selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second, non-English journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely to contain primary data 
about gene-disease relationships were also removed (e.g., Int. 
J. Health Educ, Bedside Nurse, and /. Health Econ). Together, 
these filters reduced the 12 198 221 Medline publications (July 
2002) by 37%. 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease reiafionsHips, we tested severar different~statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, inclusiveness was 
prioritized. To determine the false negative rate in MedGene, 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed in 
several other databases including the Tumor Gene Database 
(TGD). 2 the Breast Cancer Gene Database(BCG). 1 GeneCards 
(GC) 17 and Swissprot, 18 Genes were considered false negatives 
if they were represented in at least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported, by at least.one.literature reference . AlUiterature references^ 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared the list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer- related genes that were 
supported by at least one literature citation in these hand- 
curated databases. 26 were absent from MedGene. suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion. Supplemental Table 2, or visit http://hipseq.med. 
harvard.edu/MedGene/pubiicaiion/s_Table 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminaied by our specificity filters. 
Few genes were missed because they were only mentioned in 
review papers (0.4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note. 
MedGene identified approximately 2000 additional breast 
cancer- related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the titles and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information. Supplemental Table 3. or visit 
http://hipseq.med.harvard.edu/MedGene/publication/ 
s.Table 3.html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer. 1314 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information, Supplemental Table 4, or visit http://hipse- 
q.med.harvard.edu/MedGene/publication/s.Table 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm (Table 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESRl and 10 to ESR2. whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESRl or ESR2, 
this latter group was detected by the family stem term fitter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
-For this^experimentr it was reasoned- that- the-more-closely — 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus.' if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases, A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information. 
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Supplemental Figure 1, or visit http://hipseq.med.harvard.edu/ 
MedGene/publication/s.Figure l.html). For example, in one 
such cluster shown in Figure 2. diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by the false 
negative rate (-9%) and down by the false positive rate (~26% 
on average). Using this, the average disease has 103.7 ±45.3 
(mean ± s.d.) genes associated with it. although the range is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a 1-fold difference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family term): 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedGene/ 
publication/s_Figure 2.html.) Among the 505 previously un- 
related genes, 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro - 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A). There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly linked to breast cancer in the 
-literature.Uhere-was-nocorrelation_rjct 
array fold change (r = 0.018, p- value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had 1 0-fold change or more in 
expression level, the correlation increased to 0.41 (p-value = 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene, sets and the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here. 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the line) and starvation states terms (under the line) clustered together. Within these groups, there is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders. etc,($upplemental Figure 1: http://hipseq.med. 
harvard.edu/IVledGene/publication/s_Figure 1 .html). 



Finally, to validate our findings, we computed similar cor- 
relations between the breast cancer expression data and 
LPF scores generated by MedGene for hypertension, a 



disease unrelated to breast cancer. As expected, we did noi 
observe an increasing trend in correlation for hyperten- 
sion. 
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Figure 3. Relationship between literature score and functional data for breast cancer. 3A, The data from an expression analysis of 
samples for breast tumors and normal breast tissue were analyzed to indicate the fold difference of expression level between breast 
tumor and normal sample (cutoff > 3-fold change). The fold changes were plotted against the literature score for the same gene set. 
Green dots represent first-degree association by gene search, blue dots represent first-degree association by family search and red 
dots represent no-association. Some well-studied genes, such as BRCA2 (pink circle), are not reflected by a substaniial difference in 
expression level. Furthermore, the majority of genes that have no association with breast cancer in the literature had less than 10-fold 
expression changes (shaded area). 3B. The Spearman rank-correlation coefficients between literature score (LPF) and the fold change 
of expression level between tumor and normal breast samples (y-axis) in relation to the amount of fold change of expression level 
(x-axis). Gene rank lists were generated for breast cancer (blue) and hypertension (pink). Correlations were also computed between 
the breast cancer gene LPF scores and fold change expression data among estrogen receptor positive tumors only (light blue) and 
estrogen receptor negative tumors only (purple). 
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* MedGene results for the top 25 genes associated with breast neoplasms, hypertension, rheumatoid arthritis, bipolar disorder, and atherosclerosis, respectively, 
ranked by LPF scores, The hyperlink to all the papers co-citing the gene and the disease is available at MedCene website (http://hipseq.med.riarvard.edu/ 
Med Gene/). 



Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High-throughput technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret large gene lists. MedGene, a relational database 
derived by mining the information in Medline, was created to 
address this need, MedGene users can query for a rank-ordered 
list of human gene-disease relationships (Table 2) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedGene is an innovative extension of previous text mining 
_approaches.„Perez-lratxeta_et al._used „the_GO annpiatioivand 
their chromosomal locations to predict genes that may con- 
tribute to inherited disorders. 8 MedGene takes a broader view 
and includes all diseases and all possible gene-disease relation- 
ships. Furthermore, MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 6 

A unique aspect of this tool is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citatiqn and single citation. This presupposes thai 
most co-citations describe a positive association, often referred 
to as publication bias 15 and is supported by our observations 



that negative associations are rare (Supplemental Table 3; 
http://hipseq.mcd.harvard.edu/MedGene/publication/s_Ta- 
ble 3.html). Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link: 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implementa- 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the filters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different Journals, eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 

pro ache s. such as the exam inatio n of the nearest neighbor 

terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2-fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease -related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5- fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Exp^^ron Changes in ER- but 
Not In ER+ Breast Tumors 
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Table 3. MedGene identified a set of relatively understudied, yet highly 
expressed genes in ER negative, but not ER positive breast tumors. All of 
these genes have either never been co -cited with breast cancer or have a 
weak association except those marked with an *. 



reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g.. 
BRCA1 and BRCAZi and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more in expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
F irst, a caref ul hunt for corroborating evidence of a role in 
breast cancer should precede any further" studyofge nes wittT 
less than 5-fold expression level changes. Second, any genes 
with 10- fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulations that behave very differently. The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negative tumors that are worthy of 
further examination (Table 3). 



Hu et 

In conclusion!^CTave developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score thai reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datascts from various 
high-throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high-throughput 
applications. In the future, It will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

1, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and detennination 
of the chromosomal localization of novel genes. 

3 . My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 
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Serial No.: * 
Filed:* 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et aL, Biotechnology 10:413-417 (1992) (Exhibit B); Livak et aL, PCR 
Methods Appl.. 4:357-3.62 (1995) (Exhibit C) and Heid et aL, Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et a/., Proc. 
Natl. Acad. Sci. USA 95(25): 147 17- 14722 (1998) (Exhibit E); Pitti et aL, Nature 
396(6712);699-703 (1998) (Exhibit F) and Bieche et aL, Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown* 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I* declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereoa 
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• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 
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Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351. Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL Ligand homologues. Patent Number: 6,348,350. 
Date of Patent: Feb. 1 9, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 
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Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J t Corpuz RT, Simmons L f Hillan K, 
Yansura DG, Vandlen RL, Goddard AD and Gurney AL FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX t Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S., Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 1 35-1 42. 

Lee J. Ho WH. Maruoka iCl. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22, a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. ScL USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S, Wada t.;Lee J, Goddard A, French D , Handa H t 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ, Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A. (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor Gli. J. Cell ScL 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM t Pitti RM, Mark DT, Baldwin DT t Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD t Wood.WI, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB, Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 271 8-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hilian KJ t Cohen RL, 
Goddard AD, Botsteinp and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA, Lee J, Brush J, Taneyhill LA, 
Deuel B f Lew M, Watanabe C, Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hilian KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. ScL USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispecific IgG. Nature Biotechnology 1 6(7): 677-681 . 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1 998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonjfas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2UTRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19:15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Dowd P, Chemausek S, Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL, Moffat B, Vandlen R, Simmons L, Gu Q, 
Hongo JA, Devaux B, Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21 . 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL. Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M. Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
334(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1 996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current Biology 6(1 2): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. Sci. USA 93: 8241-8246. 

Yang M, Luoh SM. Goddard A, Reilly D, Henzel W and Bass S. (1996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp.1 87-21 5. 

Treanor JJS, Goodman L, de Sauvage F. Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-71 13. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, ZeiglerFC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. Sci. USA 92: 1866-1870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A. Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6:732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS, Moran P, Gripp J, Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z, Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 1 0720-1 0728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocytic leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol. 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
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Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet Cell. Genet. 59: 248-252. 
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British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
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Genomics 10: 477-480. 
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We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (da) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In fact, amplification can 
be continuously monitored in order to 
follow its progress. Tile ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring high sample through- 
put 



A. 



lthough the potential benefit* of rCR 1 to. clin- 
ical diagnostics arc well known, 2,5 , it i$ stiU not 
widely used in thh setting, even though it is 

_ , four year* tinea thcrrno*t*W* P^A poty™*** 

ase* 4 made PCR practical Some of the reasons for its slow 
acceptance are high cost, tack of automation of pre- and 
post-PCR processing steps, and false positive results, from 
carryovcr<ontanunatioD. The first two points arc related 
in that Ubor is the largest contributor to cost ait the present 
stage of PCR development. Most Current assays require 
some foroa of "downstream" processing once tbermocy* 
ding ifi done in order to determine whether the target 
DNA sequence was present arad has amplified, These 
include DNA hybridization 3 '*, gel ekctrophore*i& with or 
without use of restriction digestion 7 ;*, HPLC 9 , or capillary 
electrophoresis 10 . These methods are labor-intense, have 
low throughput, and are difficult to automate- The third 
point is aLo closely related to downstream processing. 
The handling of the FCH. product in these downstream 
processes increases the chances that amplified DNA will 
spread through the typing lab, resulting in a risk of 



carryover" false positives in subsequent testing , 
These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m whkh such different processes take 
place without, the fcced to separate reaction components 
have been termed \homogeneou5*\ No truly homoge- 
neous PCR assay has been demonstrated, to date, although 
progress towards this end has been reported. Chehab, et 
aL 1 * developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product AHck-specinc primers, cadi with different fluo- 
rescent tags* were used to indicate the genotype of ftie 
DNA. However, the unincorporated primers must still be 
removed in a do wnstream process in order to visualize the 
result Recently, Holland, et al> 15 , developed an assay in 
which the endogenous 5 f <;xdOudease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tkle probe. The probe would only cterve if PCR arnpliti- 
cation had produced its complementary sequence. Itj 
order to detect the cleavage products, however, a subse- 
quent process is again needed. . 

We have developed a truly homogeneous assay for PGR 
and PGR product detection based upon tbc gready in- 
creased fluorescence that ethidiura bromide and other 
DNA binding dyes exhibit when they are bound tojfc- 
DNA l4-l<J . As outhncd in Figure 1, a prototype PCR 
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FKfVRE I Principle of simultaneous amplification and detection Of 
PCR product The components of a PCR containing EtBr that are 
fluorescent are Ksted—EtBr itself, EtBr bound to other ssDN A or 
cUPNA. There is a large fluorescence cnhanccmcrit when EtBr Is 
bound to £>NA and hmding is greatly enhanced when DNA .is 
doubte-stranded. After sumcicnt <n) .cydcs of PGR* the .net 
incTcafte in dsCNA results ro additional EtEr binding, and a net 
increase in total nuoxcsccnce: 
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■ expected 
product 
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IWBS J Gel electrophoresis of PGR. amplification products of (he 
human, ttitdcar gene, HLA DQtx, made in die presence of 
increasing amounts of EtBr (up to 8 F-g/tnl). The presence of 
EtBr has no obvious effect on the yield or specificity of amplifi- 
cation. 





fiGGIf & (A) fluorescence measurement* ftx>m PCRs that contain 
0.5 ptfrnl EtBr and that are specific for Y-chrotno$ojoac repeat 
*equeticc*. Five replicate PCRj were begun containing each oftbe 
DNAs specified. At cadi indicated cycle, one of the five replicate 
PCRs for each DNA -Was removed from thcrmocyding and its 
fluorescence measured. Units of fluorescence Are arbitrary. (B> 
UV photography of PGR tubes (0,5 ml Eppcndorf -style, jxHyprO* 
pylcne m*ov<enirifuRc tubes) containing reactions, those start* 
mz from 2 njr male DNA and control reaction* without Any DMA, 
from (A). 7 
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begins with primers that are single-stranded DNA (ss- 
DNA), dNTPs, and DNA polymerase! An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PCR^ e , If EtBr is present, the reagent* 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the smgk-sLraxidcd 
DNA primers and to the doublc^stranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubJc-hcfix), After the first denatu ration cyde, target 
DNA will be largely singk-stranded. After a VCR is 
completed the mosl significant change i$ the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA, resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much Jess than to dsDNA, the effect of this change on 
the total fluore*ccncc of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thennocv 
cling r 



RESULTS 

PGR in fiie presence of EtBr. in order to assess the 
affect of EtBr in PGR, amplifications of the human Hl.^\ 
DQa gene >9 were performed with the dye present at 
concentrations from 0,06 to 8.0 ugmu 1 (a typical concen- 
tration of EtBr used in staining of nucleic acids following 
get electrophoresis is 0.5 pg/mf). As shown in Figure 2, gel 
electrophoresis revealed ikde or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR, 

Detection* of human Y-dktomosoroo specific 
cpences* Sequence-specific, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a series of 
ampiiScatiotu containing 0,5 u-g/mj EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc 20 - These PCRs initially contained cither 
6t> ng male, 60 ng female, 2 ng roak human or no DNA/ 
Five repfkatc PCRs were begun for each DNA* After Q y 
17, 21 , 24 and 29 cycles of thenuocyding, a PGR for cadi 
DNA was removed from the theiroocyder, and its fluo- 
rescence measured in a spectrofluorometer and plotted 
vs. amplification cyde number (Tie. 3 A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number: 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PCR$, which contained cither no 
DNA or human female DNA. The mote male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel elearopm>resta on the products of these 
amplifications showed that DNA fragments of the ex- 
pected she were made in the male DNA containing 
reactions and that lkde DNA synthesis took place in the 
control samples. 

In addition, the increase in fluorescence was visualized 
by simply laying the completed* unopened PCRs on a UV 
transfihirninator and photographing them through a red 
Alter. This is shown in figure SB lor the reactions thaL 
began with 2 ng male DNA and those with no DNA* 

Detection of specific alleles of the human f -globin 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screening, a d&cciion 
of the sicl^-ccil anemia mutation was performed- Figure 
4 shows the fluorescence from completed amplications 
containing EtBr (0.5 t*gfoil) sms dertAetad by photography 
of the reaction tubes on a UV transillominator. These 
reactions were performed using primers specific for ci- 
ther the. wild-type or skkle-cell mutation of the human 
p-gtohin gene* - The specificity for each allele u imparted 
by placing the siclue-mutation site at the terminal 3' 
nucleotide of one primer. By using an appropriate primer 
annealing tetnperatiire t primer extension— and thus am- 
plifccatior*— <an take place only if the 5' nucleotide of the 
primtr i$ complementary to the 0-gtobin allele pixttfl&> 

Each jpair o* ampltfications shown in Figure 4 consists of 
a reaction with either the wifctoypc allele spedfie (left 
tube) or skkle-aUeie specific (right tube) primers. Three 
different DNAs were typed: DNA from a homozygous, 
wfld-typc frglobin individual (AA); from a heterozygous 
sickle 0~globin individual (AS); and from a homozygous 
sickle p-$obm individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed in triplicate (3 pairs 
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c f reactions each). The DNA type vas reflected in the 
^aiivc: fluorescence intensities in each pair of completed 
amplifications. There was a significant increase in fluores- 
cence only where a (J-globin allele DNA matched the 
primer »ct. When measured or* a spcctrofltiorornetcr 
Mate not shown), this fluorescence *vas about three times 
jttt present in a PCR where both 0-globm alleles were 
^matched to the primer set. Gel ckorpphofesas (not 
ghown) established that this increase iri fluorescence 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for £-globin. There was 
little synthesis of dsDNA in reactions in which the allele- 
u>ccific primer was mismatched to both alleles. 

Continuous WMHWtoribog of a PCR. Using a fiber optic 
device?' u is possible to direct excitation illumination from 
n spectrofluorometer to a PCR undergoing thermocycling 
and to return its fluorescence to the qxaroftu orometer. 
The fluorescence readout of such an arrangement, di- 
rected Rt an EtEr-containing amplification of Y-chrorno- 
some speciftc sequences frojn 25 jig of rniman mate DNA* 
is shown in Figure 5. The readout from a control fCR 
whh no target DNA is also shown, Thirty cycles of PCR 
were monitored for each- 

The fluorescence trace as a function of time dearly 
shows the effect of the thermocyding. Fluorescence interc- 
ity rises and fails inversely with temperature* The fluo- 
rescence intensity is minimum at the denaturation tem- 
perature (<ri°C) and maximum at the annealin a/extension 
temperature (50°C>. In the negative-control PCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbcrraOcycJes, indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is little if any Weaehing of EtBr during 
the continuous illumination of the sample. 

In the PCR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds of ther*nc<ytling ) and 
continue to increase whh time, indicating that dsDNA is 
being produced at a detectable leveL Note that the fluo- 
rescence minima at the denaturation temperature do not 
fligniftcantly increase, presumably because at thh temper- 
ature there is no ckDNA for EtBr to bind. Urus the course 
of the amplification is followed by tracking the ftuorcs-. 
cence increase at the aancaKng temperature. Analysis of 
the products of these two ampbfications by gel clccfcropho- 
renin showed a ON A fragment of the expected size for the 
male DNA containing sample and no detectable DNA 
synthe/d* for the control sampte^ 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quence-Apedflc probe can enhance die specificity of DNA 
defectum hy FCR. The cHnrioa6on of iheac proccnaca 
means that* the specificity of this homogeneous assay 
depends solely on that of FCR. In the case of fickle-cell 
disease, we have shown that PCR alone has sufficient DNA 
sequence apecincky to permit genetic screening. Using 
appropriate amplification conditions, there is little non- 
specific production of dsDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens ia the sample and 
the amount of other DNA that must be taken with the 
sample- A difficult target is HIV, which Tcqurra detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells*. Compared with generic 
screening, which is performed or cells containing at least 
one copy of the target sequence* HIV [detection requires 
both more specrfidty and the input of mere total 
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nww i UV photography of PCR tubes containing ainplificauoiis 
using EtBr that are specific to wild-type (A) or itcWe (5) alleles of 
the human £-globin gcaic. The left of each psdrof tubes contains 
aflcle-tpcdfic primer* to the wild-type alleles, the right tube 
primers to the sicWe afleie- The photograph was talwh after 30 
cycles of PCR, and the input DNAs and the alleles they contain 
*re indicated. Fifty tag of DNA was used to begin FCR Typing 
was done in triplicate (3 pairs of FC&O for each input DNA: 
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fttttras Continuous, real-time monitoring of a FCR. A fiber opt* 
was Qscd to carry excitation fight to a PCR m progress and also 
emitted light back to a flaoromctcr (see Experimental Pn^crcol). 
Amplification using human malo-DNA srcafic primers in a PCR 
starting with 20 ng of human male DNA (tofW J» * control 
PCR without DNA (bottom), were monrtorad. Thirty cydes of 
PGR were followed for each. The temperature Cycled between 
94X (denaturation) and 50*C (annealing and extensjon). Note in 
the male DNA PCR,. the cycle (time) dependent mereiisc in 
fluorescence at the anncafi^g/extenaion temperature. 
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DN A— up to microgram amounts-— ia order to have suf- 
ficient numbers of target sequences. This large amount of 
stalling DNA in an amplification staibcsritfy increases 
the background fluorescence ovtr which any additional 
fluorescence produced by PGR must be detected. An 
additional complication that occurs with targets m low 
copy-number is the formation of the **priraet-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occurs the extension product is a 
substrate for PCR amplification, and can coin pete with 
true PCR targets if those targets are rare, The primer- 
dimcr product t$ of course dsDNA and thus is a potential 
source of false signal in this homogeneous a*$ay. 

To increase PGR specificity and reduce the eflfcet of 
primer-dimcr amplification, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplifications that take place in a single tube 8 , and the 
'1iot-start'\ in which nonspecific amplification fe reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 25 . Preliminary resuks using these ap- 
proaches suggest tbatprraicr^iracT is effectively reduced 
and k is possible to detect the increase in EtBr fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of iO 5 celts. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic- To. reduce thw background, it may 
be possible to use sequence-specific DNA-binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-bincHng DNA 
sequence into the PCR product through a 5' ''add-on" to 
the ohTOnudcotide primer 2 ' 1 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PCR is completed and continuously during 
thermocyefing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The fluorescence analysis 
of completed PCRs is alrcadyDossiblc with existing instru- 
mentation in 96-weJl format**. In this format* the fluores- 
cence in each PCR can be cjuantitated before* after, and 
even at selected points during thermocyciing by moving 
the rack of PCRs to a 9$-microwclI plate fluorescence 
reader 20 . 

Hie instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberoptics transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs* The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase is detected. Prelimi- 
nary experiments (Wiguchi and Bollinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — a$ it can be in genetic saramng-rcnntinuous 
monitoring may provide a means of detecting false posi- 
tive and false negative result*. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
results due to, for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles — many more' than arc necessary to detect a true 




!46638 



P:5'E> 



positive. If a sample fails to have a fluorescence increase 
after this many cycles, inhibition may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the clinic, an assessment of its false 
positive/false negative rates wfll need to be obtained using 
a large number of known samples. 

In summary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect spcci&c DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PGR, 
in applications that demand the high throughput of 
samples. 

EXPERIMENTAL PROTOCOL 

Human HLA-DQa geitt *mpH£cau<ms cMtaining EtBr. 
PCRs were set up in 1 00 >U volumes confining 10 mM Tris^HCL 
pH 8.3; 50 mM KCI; 4 niM MgC^: 2.5 units of Too DNA 
polymerase (BerVm*Elmcr Gcim, NorWfc, (TV); 20 pniote cacti 
of human HtA-DQa gene specific oligonucleotide primers 
(X&S and CH27 19 and approximately 1<P topics of DQfe PCft 
product diluted from a previous reaction. EthicMum bromide 
(EtBr; Stgtna) was used at the concentrations indicated io figure 
2. Thermocyding proceeded for 20 eveles in a model 480 
thcrmocyder (Per kin-El cntr Cecu*, Norwatk, CT) uanga "stcp- 
cyclc" program of 94*C for 2 nia denature lion and WrC for "30 
sec anncilmg and 72°C for SO sec. extension. 

Y-chromoMnac specific PCR. PCRs (100 ul total reaction 
volume) containing 0.5 jifitool EtBr were prepared as described 
for HIA-DQcr, except with dhTcrent primers and target DNAs. 
These PCRs contained 1 $ pinole each male DN A-*pccu3c primes 
YIJ and Vl.2 w , and either 60 ng male, 60 og female, 2 ng mate, 
or no human DNA. Thermocyding was JK*Cfor I min- and 60?C 
for 1 min using a "rtep-Cyclc* program. The number or cycles far 
a sample were as indicated in ffgur* 3. Fluorescence measure- 
ment is described below. t 

Allek-apccific, haman £-gtobia gews PCJEL AmpUficauons of 
100 ftl volume u$tn£ 0 5 jLg/tnl of EtBr were prepared a* 
described for HLA^DQ* above except with duTerent prUncr*and 
target DNAfi. These PCRs cotilained eiU*ct\ primer pairHGP£> 
H0MA (wiW-type gWn *pecutc primers) or HCP2/HfJH$ 
le-globin specific primers) at 10 pmoJe each primer per PCR. 
These primers were developed by Wu ct alA Three different 
tacgei DNA a were ti*cd in separate a enptifkatkms— 50 ng each of 
human DNA that was homozygous for the $kV\c trait (SS). DNA 
that was heterozygous for the sickle watt (A$)» or DNA that *rft> 
homozygous for the W.t- gJobm (AA). Thermocyefing WW for 30 
cycles at 94*C for 1 min. and S5 6 C for 1 min. itsittg 3 "step^jNaf 
program. An imnealwg temperature of 55X bad licen .shown *y 
Wa tt al. 21 to provide allclc^pcdfk: aty*pIitouon- ^n^P^/J 
PCRs were photngraphed through a red filter (WratieivSaA) 
after placing the reaction twbes atop a model TM-56 transfflunu- 
nator (tiV-productS San- Gabriel, CA). 

Fhiarescc»«ce measurement. Fluorescence measurement* were 
road^ on PCRs containing EtBr in a Fluorolog-2 utiorometer 
(SPEX, Edisyn, NJ). Excitation was at the 500 nra band with 
About 2 ntn bandwidth with * GO 435 nm cutoff f£*t<jMd*cs 
Grist Inc., Irvine. CA) to exclude second-order light. Eiwiuea 
light was detected at 570 nm with a band width of about 7 nm« An 
OG 530 nm cut-off filter was used to remove the exchaoon h$M. 

ContimtooA Aoc«reftcence xatraitoring of FCR, Coi>tinvouA 
monitoring of a PCR in progress was accomplished using fflC 
spcctrofluorometcr and settings descrtbod >bovc as W«W a* 2 
nbcroptic accessory (SPEX cat. no. 1950) 10 both send cxatauon 
light to ( and reeeive emitted Ught from, a PCR pheed m a weU oi 
a model *80 uiej inocycler (Pcrkin-Elrter Cetus). The probe eflo 
of the fiberoptic cable was attached will* "5 nwMtc-cpoxy* to ttfc 
open top of a FCR cube (a 0.5 ml polypropylene ccntrifogc WW 
with its cap removed) effectively flealing ic The exposed top <n 
r,he PCR tube and the end of the fiberoptic cabte were shiewca 
from room light and the roora lights were kept dimmed dvrmg 
each run. The pioniiored PCR was an ao>p1ifieautfn of V-a? 7 ^ 
mosome-spedfic repeat scqyeiJCCS as described above, except 
using. an annealing/extension cemperamrc of 5(TC. The reaCUt^ 
was covered ivith miji^ ofl (2 drops) to prevent evapdranon- 
Tbcrnjocyding and fiuorcsccnce measurement were started si- 
multaneously, A time-base son with a 10 second integranort tune 
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uwjcl and the <smtesion signal was rarioed to the excitation 
jiigrtft] to control foe change* in li^ht-jourec intenjrity. Oata were 
^ollcd^d using the droSOOOf, version £.5 (SFEX) data system. 
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IMMUNO BIOLOGICAL LABORATORIES 



SCD-14EUSA 

Trauma, Shock and Sepsis 




The CO-14 molecule is' expressed on the surface of 
monocytes and some macrophages. Membrane- 
bound CO-14 is a receptor for ltpoiX)lysaccharide 
(LPS) complexed to LPS-Sinding-Protein (IBP). Tne 
concentration of Jts soluble form is aftered under 
certain pathological conditions. There is evidence for 
an important role of $CD-14Avith polytrauma, sepsis, 
burnings and inflammations, 
During septic conditions and acute infections ii seems 
to be a prognostic mariner and is therefore of value in 
monitoring these patients. 



JBL offers an EUSA for quantitative determination of 

soluble CD-I 4 in human serum, -piasnna, ceil-cuiture 

supematants and other biological fluids. 

Assay features: 12x8 determinations 
(microliter strips), 
precoated with a specific 
monocJona) antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/rrd 
detection limit: 1 ng/mi 
CV: intra- and interassay < 8% 



for more information call or fax. 
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We nave enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing the reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR. Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (d&) DNA an increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni» 
tored externally* In fact, amplification can 
be continuously monitored in order to 
follow its progress. Hie ability to simufta- 
tieously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in the clinic or in other 
situations requiring: high sample tiirough- 
put 

Although the potential benefits of PCR 1 to. clin- 
ical diagnostics arc well known*- 5 , it is still not 
widely used in this setting, even Plough it is 
four year* aiuco thcruKwtiW* DNA polymer- 
ase* 4 made PCR practical. Some of the reasons fot its slow, 
acceptance are high cost, tads, of automation of pre- and 
post-PCR processing steps, and false positive results, from 
carryoveT-contaminatioD. The first two points arc related 
in that Ubor is the largest contributor to cost at the present 
stage of PCR development. Most Current assays requite 
some form of "downstream" p«>eessing once tnermocy* 
ding is done in order to determine whether the target 
DNA sequence was present arid has amplified, These 
include DNA hybridization**, gel ekctrophorexis with or 
without use of restriction digestion*:*/ HPLC?, or capillary 
electrophoresis 10 . These methods are labor-intense, have, 
low throughput, and arc difficult to automate. The third 
point is ako closely related to downstream processing. 
The handling of the PGR product in these dowrtstrcani 
processes increases the chances that amplified DNA . wUl 
spread through the typing lab, resulting in a risk of 



carryover" false positives in subsequent testing 
These downstream processing steps would be elimi- 
nated tf specific amplification and detection of amplified 
DNA took place simultaneously within an unopened re- 
action vessel Assays m whkh such different processes take 
place without, the need to separate reaction components 
have been termed ^omogeneous"\ No truly homoge- 
neous PCR assay has been demonstrated to date, although 
progress towards this end has been reported. Chehab, et 
aL*% developed a PCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product AHcte^peeific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers must still be 
removed in a do wnstream process in order to visualize the 
result- Recently, Holland, et al> ls , developed an assay in 
which the endogenous 5 r -ex6nudease assay of Taq DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only dcave if FCR axup$ti~ 
cation had produced its complementary sequence. In 
order to detect the cleavage products, however, a subse- 
quent process is again needed. 

Wc have developed a truly homogeneous assay for PCR 
and PCR product detection based upon tbc gready in- 
creased fluorescence that ethidium bromide and ouser 
DNA binding dyes exhibit when they are bound to ds- 
DNA 14_ie . As outHrtCO* in Figure h a prototypic PCR 
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RGfHtE I Prindptc of simultaRCOus amplification and detection of 
PCR product The component of a PCR containing EtBr diat arc 
fluorescent are listed— EiBr itself, EtBr bound to other ssDN A ot 
daDN A. There is a lar^e ftiiorescencc cnhanamicnt when EtBr is 
bound to DNA and binding; is greatly enhanced when DNA .is 
double-stranded. After suSdcnt <n)..cydcs of PGR* the net 
increase in dsDNA results in additional EtBr binding, and » net 
increase in total ftuarciccncE: 
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WCOTT % Gel electrophoresis of PCS. ana plificatbn products of the 
human, fttfdear gene, HLA DQct, made in the presence of 
increasing amounts of EtBr (up to 8 M-gtau). The presence of 
EtBr Ijas no obvious effect on ike vieM or specificity of amplifi- 
cation. 





Htmff 3 (A) Fluorescence measurement* ftx>ai PCRs that contain 
0.5 jxg/mJ EtBr and that are specific for Y<hrotno*OOTc repeat 
sequences. Five replicate PCRs were begun containing each of the 
DNAs specked. At each indicated cycle, one of the five replicate 
PCRs for each DNA *as removed from thcrmocydmg and Hs 
fluorescence measured. Units of fluorescence art arbitrary. (B) 
UV photography of PCRtube* (0,5 ml EppcndoTf-stylc, jx>typn> 
pykne mtaxxentrifuffc tubes) containing reactions, those *ea.nv 
ing from 2 ng male DNA Mid control reactions without any DMA, 
from (Ay 



begins with primers that are single-stranded DNA (ss- 
DNA), dNTTs, and DNA polymerase! An amount of 
dsDNA containing the target sequence (target DNA) is 
also typically present. This amount can vary, depending 
on the application, from single-cell amounts of DNA 17 to 
micrograms per PGR* 8 . If EtBr is present, the reagent* 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr hscif, and EtBr bound to the single-stranded 
DNA primers and to the dwblc-stnmded target DNA (by 
its intercalation between the stacked bases of the DNA 
doubkr-hefix)* After the first denatu ration cyde, target 
DNA will be largely sinrie-stranded. After a PGR is 
completed, the most significant change is the increase in 
the amount of dsDNA (the PGR product itself) of up to 
several micrograms. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much Jess than to dsDN A, the effect of this change on 
the total fluorescence of the sample is smalt The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy. 
ding. ' 



RESULTS 

PGR in the presence of EtBr. In order to assess the 
affect of EtBr in PGR, amplifications of the human Hl-A 
DQot gene* 9 were performed with the dye present at 
concentrations from 0.06 to 8.0 pLg/ml (a typical concen- 
tration of EtBr used in staining of nuctek aods fofioiving 
get electrophoresis is 0*5 ug^E)- As shown in Figure 2> gej 
electrophoresis revealed little or no difference in the yield 
or quality of the amplification product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PGR, 

Detection of huxnan Y-chromosoimj specific 
aiiences- Sequence-specific, fluorescence enhancement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0,5 ug/ml EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromosomc^- These PCRs initially contained cither 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA. After 0, 
17, 21, 24 and 29 cycles of thermocyding, a PGR for each 
DNA was removed from the thermocyder, arid its, fluo- 
rescence measured in a spectroflnorometer and plotted 
Vs. amplification cycle number (Fig. 3 A), The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exponential with cyde number; 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNA. The more male DNA 
present to begin with— 60 ng versus 2 ng— the fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel eJectrophWesis on the products of these 
amplifications showed that DNA fragments of the ex- 
pected skc were made in the male, DNA containing 
reactions and that little DN A synthesis took place in the 
control samples. 

In addition, the increase in. fluorescence was visualized 
by simply laying the completed* unopened PCRs on a UV 
transillujninatOT and photographing them through a red 
filter. This is shown in figure 3B tor the reactions that 
began with 2 ng male DNA and those with no DNA- 

Detection of specific alleles of the human p-globin 
gene. In order to demonstrate that this approach has 
adequate specificity to allow genetic screening, a dttotfion 
of the sickle-cell anemia mutation was performed* Figure 
4 shows the ftuotescencc from completed amplications 

containing EtBr (0.5 Kg/ml) a* detected by photography 

of the reaction tubes on a UV transilluminator. These 
reactions were performed using primer* specific for ci- 
ther the w3d-m>e or skkk-cell mutation of the .human 
p-globm gene . The specificity for each alklc is imparted 
by placing the sickle-mutation site at the terminal 3' 
nucleotide of one primer. By using an appropriate primer 
annealing temperature, primer extension—and thus am- 
plification— can take place only if the 3' nucleotide of the 
primer i$ complementary to the $-globin allele p^c*cnt* , ■ 
Each pair Of amplifications shown in Figure 4 consists of 
a reaction with either the wild-type allele spedttc (left 
tube) or skkle-allele specific (right tube) primers. Three 
different DNAs were typed: DNA from a homozygous, 
wild-type g-globin individual (AA); from a heterozygous 
sickle flglpbin individual (AS); and from a homozygous 
sickle P-gioWn individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was analyzed m triplicate (3 pairs 
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p f reactions each). The DNA .type \vas reflected in the 
relative: fluorescence intensities in each pair of completed 
flm plificatk>ns. There was a significant increase in fluores- 
ce* o^ly where a p-globin allele DNA matched the 
primer set. When measured oa a spcctrcflnorometcr 
[fata not shown), this fluorescence was about three times 
tjrtt present in a PCR where both ft-globm alkies were 
^matched to the primer set. Gel electrophoresis (not 
(fltowa) established that this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for £-globin. There was 
litdc synthesis of dsONA in reactions in which the allele- 
jpedfic primer was mismatched to both alleles. 

Continuous m^toriog of a PGR. Using a fiber optic 
devieerk is possible to direct excitation illumination from 
ji spectrofluorometer to a PCR undergoing thermocycliog 
and to return its fluorescence ro the RpectroftuorometCT. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr-containing amplification of Y-chromo- 
some spcci6c sequences from 25 ng of rrumsr* male DNA, 
U shown in Figure 5. The readout from a control PCR 
vim no target DNA is also shown. Thirty cycles of PGR 
were monitored for each- 

The fluorescence trace as a function of time clearly 
shows the effect of the thennowding. Fluorescence inten- 
sity riaes and fails inversely with temperature* The fluo- 
rescence intensity is minimum at the denaturation tem- 
perature (94°C) and maximum at the anneaUn^extension 
temperature (50°C). In the negative-control PCR, these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty thermocydes, indicating that there is 
little dsDNA synthesis without the appropriate target 
DNA, and there is little if any Weaehing of EtBr during 
the continuous illumination of the sample. 

Jn the PCR containing male DNA, the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 seconds" of thcrmc<ycung, and 
continue to increase whh time, indicating that dsDNA is 
being produced at a detectable level. Note that the fluo- 
rescence minima at the denaiurattou temperature do not 
significantly increase, presumably because at this temper- 
ature there is no d&DNA for EtBr to bind. Thus the course 
of the amplification is followed by tracking the fluorcs-. 
cence increase at the annealing temperature. Analysis of 
the products of these wo amplifications by gel electropho- 
resis showed a DNA fragment of the e xp ected size for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quence-specific probe can, enhance tlie specificity of DNA 
deteiAiuu If> PGR. The climirwtk»«i <rf Uicac procc-roco 
means that' the specificity of this homogeneous assay 
depends solely on that of PCR. In the case of skkle-celi 
dueasc, wc have shown that PCR ak>oc has sufficient DNA 
sequence apectfichy to permit genetic screening. Using 
appropriate amplification conditions, there is little non- 
Specific production of dsDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required' to do genetic screening, 
depending on the number of pathogens iix the sample and 
the amount of other DNA that must be taken with the 
sample. A difficult target is HIV, which requires detection 
of a viraJ genome that can be at the level of a few copies 
per thousands of host cells 6 . Cotnoared with genetic 
screening, which is performed on ceils ^containing at least 
one copy of the target sequence* HIV "detection requires 
both more specificity and the inpxu of mote total 



Homozygous 

AA 



Heterozygous 

AS 



Homozygous 

ss 



<* UV photography of PCR tubes containing ampEficaiioiis 
using EtBr that are specific to wild-type (A) or lic&te (S> alldcs of 
the human 0-gLobin gene. The left of each pair of tubes contains 
aBele-spccifie primer* to the wild-type slides, the right tube 
primers to the sickle aJttek- The photograph was taken after SO 
cycles of PCR, awl the input DNAs and the alkies they contain 
are indicated. FSfty lag of DNA was used to begjn PGR. Typing 
was done in triplicate (3 pairs of FCfcs) for each input DNA: 
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HGtfBI S Continuous, real-time monitoring of a PCR. A fiber optk 
was used to carry excitation fight to a rtiR in progress aad also 
emitted light back to a flooro meter (sec Experimental Protocol). 
Anrptificadon USiog human malo-DNA specific prnscrs in a PCR 
Starting with 20 ng of hwuvan male DNA (top* °r in * control 
PCR without DNA bottom), were roonhoTcd. Thirty cydes of 
FCR were foJtowed for each. The temperature cycled between 
94*C (denaturation) and 50*0 (annealing and extension}. Note in 
the male DNA PCR,. the cycle (time) <ftcptt&CQt increase in 
fluorescence at the aoiie*Btt#extcnsion temperature. 
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DNA— up to microgram aniount»™in order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA in an amplification sistafcmtly increases 
the background fluorescence over wnicb any additional 
fluorescence produced by PGR must be detected. An 
additional complication that occurs with targets m low 
copy-number is the formadon of the "primer-dimer" 
artifact. This is the result of the extension of one primer 
using the other primer a template. Although this occurs 
infrequently, once it occurs me extension product is a 
substrate for PCR amplification, and can compete with 
true PCR targets if those targets are rare, T ne primex- 
dimcr product i$ of course dsDNA and thu s i* a potential 
source of false signal in this homogeneous assay. 

To increase FOR specificity and reduce the efTcct of 
primer-dimcT antplification, we are investigating a num- 
ber of approaches, including the use of ncstedrpruner 
amplifications that take place in a single tube 8 , and the 
"hot-start", in which nonspecific amplification is reduced 
by raising the temperature of the reaction before DNA 
synthesis begins* 5 . Preliminary results using these ap- 
proaches suggest tbatpruncr-dimcT is effectively reduced 
and it is possible to detect the increase in EtBr fluores- 
cence in a PCR instigated by a single HIV genome in a 
background of 10* cells. With larger numbers of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic. To reduce this background, it may 
be possible to use sequence-specific DNA»binding dyes 
that can be made to preferentially bind PCR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PCR product through a 5' "add-on" to , 
the oUgonuclcotide primer 24 . 

We have shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once PCR is completed and continuously during 
thertnocyefing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. The Huorescence analysis 
of completed PCRs is alrcadyjfiossiblc with existing instru- 
mentation in 96-weJl format*** In this format, the fluores- 
cence in each PCR can be cjuantitated before, after* and 
even at selected points during therraocyciirtg by moving 
the tack of PCRs to a 9u-mierowc!l plate fluorescence 
reader 20 . , 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberopdes transmit the excitation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 5 shows that 
the larger the amount of starting target DNA, the sooner 
durittg PCR a fluorescence increase is detected. Prelimi- 
nary experiments <Wiguchi and DoHinger, manuscript in 
preparation) with continuous monitoring have shown a 
sensitivity to two»fold differences in initial target DNA 
concentration. 

Conversely, if the number of target molecules is 
known — as it can be in genetic scrcenjng-r^ntinwous 
monitoring may provide a means of detecting false posi- 
tive and false negative results. With a known number of 
target molecules, a true positive would exhibit detectable 
fluorescence by a predictable number of cycte of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts. False negative 
results due to, for example, inhibition of DNA polymcr> 
ase, may be detected by including within each PCR an 
inefficiently amplifying marker. This marker tcsuIcs in a 
fluorescence increase only after a large number of cy- 
cles — many more' than arc necessary to detect a true 



B!Q/IK>NaOGY VOL 10 APRL1092 



positive. If a sample fails to have a fluorescence increase 
alter this many cycles, inhibition may be suspected. Since, 
«> this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event Before any test based on this 
principle is ready for the clinic, an assessment of its false ; 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion in PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect spcci&c DNA amplification from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
in applications that demand die high throughput of 
samples. 

EXPERIMENTAL PROTOCOL 

Human HLA-DQn geae ^pWkadons cmUaining EtBr, 
PCRs were set up itt 100 j*l volumes containing 10 mM Tris-HCl, 
pH 8.3; 50 mM KC1; 4 roM MgC^: t£ unit* of Toj DNA 
polymerwe (Per1ti»«.E}mcr Genu, Norwalk* CT); 20 pinole each 
of human HUV-DQa gene specific oligonucleotide primers 
(mlb and CH27 19 and approximately NT copies of DQfe PCR 
product diluted fri>m a previous faction. Ethidium bromide 
(Et&n StgttiA) was used at the concentrations indicated in Figure 
2. TheTmocyding proceeded for 20 cvdes tn a model 480 
dwrmocydcr (Perkiti-EImer Cccu*, Norwalk, CT) u«ng a "step- 
cycle*' program of 94*G for 1 min. dcnaturalion and 60°C rot* w 
sec an raffing and 7VC for 30 irc. e*ten$to». 

Y-chrontosomc specific PCR. PCRs (100 jd. total reaction 
volume) containing v£ ptg/ral EtBr were prepared as described 
for HLA-DQcr, except wiSn difTercnt ririmers and target I)NAs. 
These PCRs contained 1 5 pmolc each male DN A-spccifjc primers 
VI . ) and V I.2 M , and chhcr 60 ng male, 00 Og female, 2 ng mak, 
or no human 1>NA. TherroOcycling W*$94?G Tor 1 min, and 60?C 
for 1 min using a "step-cyde* program. The number of cycles for 
a sample were as indicated itt Figure 3. Fluorescence measure- 
ment « described below. t 

Allck^spccbic, human p-gtofrin gw* PCR, AmpUficauons of 
100 v-\ vpiumc ustnff 0 5 fifi/ml of £tBr were prepared as 
described for HUUDQa above except with different prm>cr» and 
target DNAs. These PCRs cOtiiained eilher. primer pair HGP2' 
UpHA (wfaHvpe gWn speci&c primers) or HGF&HpMS <sick- 
lc-giobin spedBc primers) at 10 pmole each primer per PCR, 
Tnese primers were developed by Wu ct aL 2 \ Three different 
tatg& DNA* were t«c<l in separate amplifications— 50 ng each of 
human UNA that was homozygous for the «*ckk trait ($5)* DNA 
that wa£ heterozygous for the sickle wak <A$)> or DNA that was 
homozygous for die w.l- ^Jobin (AA). ThcTmocycHng was fox 3D 
cycles at 94°C for 1 mm, and 55^ for 1 min. itsinj a "stcp-cydft 
program. An anneaHng ttmpcrature of 55^0 h*d heen shown ny 
VVu ct al 2J 10 provide alldc^pcrifk awpliheation. vomplcted 
PCRs were pheSngraphed through a red ftltcr (Wraticn _z3A) 
after placing the reacSon tubes atop a model TM-S6 transUlumi- 
natox (OV-productS Sah-Gabricl, CA>. 

nitorescertee meaanranent. Fliiorcsoeoce ro<^rcTOcn ^ were 
mad* oh PCRs containing EtBr in a Fluorolog-2 fltioromCtcr 
(SPEX Edison, NJ). ^citation was at the 500 nra band witi* 
ahour 2 ntn bandwidth with a GO 435 nm cutoff ^r<McMc S 
Grist Inc. Irvine. CA) to exclude scccti^^rder ^t. Ewritcd 
hght was detected at 5?0 nm with a bandwidth of about 7 nm. An 
OG 530 nm cut-nff Biter Was used to remove the excitation hpM 

ContitHtouft ftaorescenee momtormg of PCR. Coimnuou* 
monitoring of a PCR in progress was accomplished using tnC 
spcctrofluoromeicr and setdnga descrtbod above as mu as a 
fiberoptic accessory (SPJ&X cat no. 1950) to both send cxatatjon. 
fight to, and receive emitted light from, a PCR pfceed m a weU oj 
a model 430 ihermocydcr (Perkm-Elmer Cetus). The prohc ena 
of the fiberoptic cable was attached viih U S mm utc-cpoxy" to ihc 
open top of a PCR tube (a 0.5 ml p^ypropyiene centrifuge WW 
with its cap removed) effectively wealing ic The exposed top 
the PCR tube and the end of the fiberoptic caWe were shjcldco 
from room light and the rooco Ijghu were kept dirraned dumjS 
each run. The monixored FOR was an ampTxficaudn of Y-tfjro- 
mosdme-spedfk: repeat scqweftCCS as described above, except 
usinff.an anncah'ng/extension temperature of 50X. The reaCtJon 
was covered with mij*tr*l oil (i drops) to prevent evaporation- 
ThcrnjQCydmg and fluoresccrJCC measurement were started si- 
multaneously, A ume-basc scan with a 10 second mtegrarjOt> tunc 
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uwsd and the emtaciou signal was raiioed to the. excitation 
aign.nl to control for ctamftts in light-source intensity, ffcta were 
^Sleeted using the dra3000f t version S,5 (SFEX) data system. 

\Vc <tan& Bob Jouca for help with *he spectrofluormcrric 
(nrtfutffct»<MilJ and >!calherbe!> Fonjrfor editing this manuscript. 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Giusti, and Karin Deetz 

Perkin-Elmer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenk probe 
during the amplification reaction. 
The probe Is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' nucle- 
olytlc activity of Taq DNA polymerase. 
In this study, probes with the 
quencher dye attached to an Internal 
nucleotide were compared with 
probes with the quencher dye at* 
tached to the 3'-end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence* In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal in the 5' nuclease PCR assay than 
the Internally labeled probes. It Is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR* Probes 
with the quencher dye attached to 
the 3 -end nucleotide also exhibited 
an Increase In reporter fluorescence 
intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
dyes attached at opposite ends can 
be used as homogeneous hybridiza- 
tion probes. 



homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al. (t) 
The assay exploits the 5 f -+ 3' nucle- 
olytic activity of Taq DNA poly- 
merase^ and is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5* end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET).< 4 - 5) During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' -» 3' nucleotytlc 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an increase in fluorescein fluores- 
cence intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. <6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is not required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 

MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
rarnethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-Elmer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-labeIed phos- 
phoramidite at the 5' end, LAN replacing 
one of the T's in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of S' nuclease assay. Stepwise representation of the 5' - 3' 

5S " «5 DNApolymerase acting on a fluorogenic probe during one extension phase of POL 



rriM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x 4.6- 
mm column with 7-fun particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m. TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-uJ reactions that con- 
tained 10 mM Trls-HCl (pH 83), 50 mM 
KG, 200 pM dATP, 200 uw dCTP, 200 pM 
dGTP r 400 |tM dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human (i-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajima-1l|ima et aL) (7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al (8> Actin am- 
plification reactions contained 4 mM 
MgCl 2 , 20 ng of human genomic DNA, 
50 nM Al or A3 probe, and 300 nM each 



TABLE 1 Sequences of Oligonucleotides 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 sec), 60°C (1 min), and hold at 72°C. 
A S15-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUC119. 
These reactions contained 3.5 mM 
MgCl 2 , 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (I min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-uJ 
aliquot of a sample was transferred to an 
individual well of a white, 96-well rnlcro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation ni- 
ter, and a 515-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
Q. value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the raw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



Name 


Type 


F119 


primer 


R119 


primer 


P2 


probe 


P2C 


complement 


P5 


probe 


P5C 


complement 


AFP 


primer 


ARP 


primer 


Al 


probe 


A1C 


complement 


A3 


probe 


A3C 


complement 



Sequence 



ACCCACAGGAACTGATCACCACTC 
ATGTCGCGTrCCGGCTGACGTTCTGC 
TCGCATTACI GATCGTi'GCCAACCAGTp 
GTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGA'lTTGCTGGTATCrATGACAAGGATp 
TTCATCCTTGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 
CAGCGGAACCGC1X2ATTGCCAATGG 
ATGCCCTCCCCCATGCCATCCTGCGlp 
AGACGCAGGATGGCATGGGGGAGGCCATAC 

CGCCCTGGACTTCGAGCAAGAGATp 
CCATCTCTTGCrCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this stuay, me miuw, ~ ° «mbeused 

T-Ty direcuon. There are three types of oligonucleotides: PCR primer, 

to the 5' nuclease assay, and complement used to ^f«^ 

probes, the underlined base indicates a position where UN with TAMRA attachea wa 

tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A1-2 RAQGCCCTCCCCCATGCCATCCTGCGTp 

A1-7 RATGCCCQC'CCCCATGCCATCCTGCGTp 

A1 -1 4 RATGCCCTCCCCCAQGCCATCCTGCGTp 

A1-19 RatgccctcccccatgccaQcctgcgtp 

A1-22 RATGCCCTCCCCCATGCCATCCQGOGTp 

A1 -26 RatgccctcccccatgccatcctgcgQp 



Probe 


518 nm 


582 


nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ temp. 








A1-2 


25.5 i 2.1 


32.7 ±1.9 


38.2 ± 3.0 


38.2 ±2.0 


0.67 + 0.01 


0.66 ±0.06 


0.19+0.06 


A1-7 


53.5 + 4.3 


395.1 ±21.4 


108.5 ±63 


110.3 ±5.3 


0.49 ±0.03 


3.58 ±0.17 


3.09 ±0.18 


A1-14 


127.0 ±4.9 


403.5 ±19.1 


109.7 ±5.3 


93.1 ±6.3 


1.16 ±0.02 


4.34 ±0.15 


S.18±0.15 


A1-19 


187.5 ±17.9 


422.7 ±7.7 


70.3 ±7.4 


73.0 ±2.8 


267 ±005 


5^0 ±0.15 


3.13 + 0.16 


A1-22 


224.6 ±9.4 


482.2 ± 43.6 


100.0 ±4.0 


96.219.6 


2,25 ±0.03 


5.02 ±0.11 


2.77 ±0.12 


A1-26 


160.2 ±8.9 


454.1 ± 16.4 


93.1 ±5.4 


90.7 ±3.2 


1.72 ±0.02 


5.01 ±0.08 


3.29 ±0.08 



FIGURE 2 Results of 5' nuclease assay comparing p-actin probes with TAMRA at different nude- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.D. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-well variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RCT). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5 f nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human 0-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the p-actin gene containing the target 
sequence. Performance in the S' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target. Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5' end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, all five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger Increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ" depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes AM4, Al-19, Al-22, and 
At -26 probably have reduced quenching 
as compared with Al-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3* end nucleotide- The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ - values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5 f Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terminal Nucleotide 



518 nm 



582 nm 



Probe 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ + 


ARQ 


A3-6 


54.6 ± 3.2 


84.8 ± 3.7 


116.2 ± 6.4 


115.6 ±2.5 


0.47 ± 0.02 


0.73 ± 0.03 


0.26 ± 0.04 


A3-24 


. 72.1 ± 2.9 


236.5 ± 11.1 


84.2 ± 4,0 


90.2 ± 3.8 


0.86 ± 0.02 


2.62 ± 0.05 


1.76 ± 0.05 


P2-7 


82.8 ± 4.4 


384.0 ±34.1 


105.1 ± 6.4 


120.4 ± 10.2 


0.79 ± 0.02 


3.19 + 0.16 


2.40 ±0.16 


P2-27 


113.4 ±6.6 


555.4 ± 14.1 


140.7 ± 8.5 


118.7 ± 4.8 


0.81 ± 0.01 


4.68 ± 0.10 


3.88 ± 0.10 


P5-10 


77,5 ± 6.5 


244.4 ± 15 9 


86.7 ± 4.3 


95.8 ± 6.7 


0.89 ± 0.05 


2.55 ± 0.06 


1.66 ± 0.08 


P5-28 


64.0 ± 5.2 


333.6 ±12.1 


100,6 ± 6.1 


94.7 ± 6.3 


0.63 ± 0.02 


3,53 ± 0.12 


2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5 r nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for pTobes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ. We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 24 * effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 "*" concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 2 "" is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2 * are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 "" followed by 
a gradual decline as the Mg 2+ concen- 
tration Increases to 10 mM. Probe Al-14 
shows an intermediate RQ value at 0 nut 
Mg 2 * with a gradual decline at higher 
Mg 2 * concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 24 ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position in an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes 



518 nm 



582 nm 



RQ 



Probe 



ss 



ds 



ds 



ss 



ds 



Al-7 

Al-26 

A3-6 

A3-24 

P2-7 

P2-27 

P5-10 

P5-28 



27.75 
43.31 
16.75 
30.05 
35.02 
39.89 
27.34 
33.65 



68.53 
509 3S 

62.88 
578.64 

70.13 
320.47 
144.8S 
462.29 



61.08 
53.50 
39.33 
67.72 
54.63 
65.10 
61.95 
72.39 



138.18 
93.86 
165.57 
140.25 
121.09 
61.13 
165.54 
104.61 



0.45 
0.81 
0.43 
0.45 
0.64 
0.61 
0.44 
0.46 



0.50 
5.43 
0.38 
3.21 
0.58 
5.25 
0.87 
4.43 



(ss) Single-stranded. The fluorescence emissions at S18 or 582 nm for solutions containing a final 
concentration of 50 nM indicated probe, 10 mM Tris-Hd (pH 8.3), 50 mM KCI, and 10 mM MgCl 2 . 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for piobes A3-6 and A3-24, 100 n M P2C for probes P2-7 and P2-27. or 100 nM 
PSC for probes P5-10 and P5-28. Before the addition of MgCI* 120 jd of each sample was heated 
at 95°C for 5 min. Following the addition of 80 »il of 25 mM MgCi 2 , each sample was allowed to 
cool to room temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ" , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg 2 * concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Tris-HCI {pH 8.3), 50 mM KC1, and varying amounts (O-10 mM) of MgQ z . The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (dim 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
other factors that reduce flexibility of 
the oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (1) 

The rise in RQ" values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 
of an internal position. In effect, a 
quencher at the 3 f end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQ~ values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ~ value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ~. 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and P5 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not Increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations In an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. (1> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target. This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AF508 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an' Internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, couid 
also be used to determine whether hy- 
bridization has occurred or not. Thus, , 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridisa- 
tion In real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al.° 0) describe Just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR amplifi- 
cation. 
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Wc have developed a novel "real time" quantitative PCR method. The method meaoirec PCR prodnrt 
accumulation through a duaUabeled fluoroeenlc probe (Lc, TaqMan Proba). This method provides very 
accurate and reproducible quantitation oF gene copies. Unlike other quantitative PCR methods, real-time rai 
does nor require post-PCR sample handling* preventing potential PCR product carry-over contamination and 
resulting In much raster and higher throughput assays. The real-time PCR method has a very large dynamic 
ranee of starting tarter molecule determination (at least five orders of magnitude). Real-time quantitative 
PCR is extremely accurate and less labor-intensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence analysis Mas 
had an important role in many fields of biologi- 
cal research. Measurement of gent expression 
(RNA) has been used extensively In monitoring 
biological responses to various stimuli (Tan et al, 
1994; Huanft el at. I995a,b; Prud'hommc et al. 
1995). Quantitative gen* analysis (UNA) has 
Ih-ch used to determine the genome quantity of a 
particular gene, as in the case of the human HKK2 
gene, which Is amplified in -30% of breast tu- 
mors (Slarnon ct al. 1987). Gene and genome 
quantitation (DNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
(ilJV) buTdcn demonstrating changes in the lev- 
els of virus throughout the different phases of the 
disease (Connor et al. 1993; Platak el al. JVv:*b; 
J ; urtado et al. 1995). 

Many methods have been described for the. 
quantitative analysis ot nucleic acid sequences 
(both fur RNA and DNA; Southern l y/*>; Sharp ct 
al. 1980; Thomas 1980). Recently, PCR ha* 
proven to be a powerful tool for quantitative 
nucleic acid analysis. PCR and reverse transcrip- 
tase (KT)-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has mode i>os- 
siblc many experiments that could not hove, been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 



r to m 



thai it be uoeU properly for quantitation (tta«y. 
maekers 1995), Many early reports of quantita- 
tive: PCR and RT-PCR described quantitation of 
the VCR product but did not measure, the initial 
target sequence quantity. II is essential to design 
proper controls for the quantitation of the initial 
target sequences (Pcrrc 1992; Clement I ct al. 
190?.) 

KesMi'chcrs have, developed several methods 
of quantitative PCR and RT-PCR. One approach 
measures rCR product quantity in the log phase 
of the read ion before the plateau (Kellogg et al. 
1990; Pang et a), 1990). This method requires 
thai each sample has equal input amounts of 

■ nucleic add and that each sample under analysis 
amplifies with kicnUc.il efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained in wl I samples at relatively constant quan- 
til'itf*, such as p-aclln) can be used for sample 
amplification efficiency normateation. Uslny 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene and the 
normalization gene), Another method, quantita- 
tive competitive (QC)-PCH, has been developed 
and is used widely for PCR quantitation. QC-PCR 
relics on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Hatak et al, 1993*,1>). The efficiency of each re- 
action is normalised to the internal competitor. 
a imnwn auuiuni of lnt«*.maJ competitor can be 

annrv 7nc« new wj RCifrT 7nn7/cn/7T 



From I BML 



PHONE No. : 310 472 090S 



Dec. 05 2002 12:21AM P12 



ULAL 1IML OUANIUATIVI Kltf 



v 



added to oach sample. To obtain relative quant* 
ration, 1hc unknown target PGR product is com- 
pared with the known competitor K":k product, 
.Success of a quantitative competitive I'CU assay 
relies on developing an Internal control (hot am- 
piinv* with the same efficiency as ilic ungei 
eculc. The design of the coiiipethoi and the vali- 
dation of amplification cfhcieiieiea. icquirc a 
dedicated vffwl. However, because QCi-PCR does 
not require that POR pioducls be analyzed during 
the log phase of the amplification, it is the. easier 
uT the two methods to use. 

Severn I detection .system* uie used for quan 
tltative 1*CR and RT-1»Ctt analysis; (3) agarose 
gets, (2)iluur»<.vnl labeling of PCR products and 
detection wl(h lnawT-induccd fluoroseenec using 
capillary elitetTophoreriia (h'usco et ah 1995; Wil- 
liams ct at. 1 996) or acrylaiulde gel it, and (3) plate 
capture and sandwich probe hybridization (Mul- 
der et ah 1994). Although these method* pmved 
successful, each method requires post-J*CR ma- 
tlipularlons That add time to the analysis and 
may lead to laboi atuty i on lamination. The 
sample Lhiuughput uf these* method* i.% limited 
(wllh |htr exception of the plate capture ap- 
proach), and, therefore, these methods ore not 
well >uitr.U foi uj»o demanding bigh sain pie 
throughput (i.e., screening of large numbers of 

hU*tuwlci.u]<r:t ui diiaty/.lil£* SVuuplva fwx diagnu*' 
Ucs or clinical trials), 

1 lore we report the development of u novel 
assay for quantitative 1WA analysis. The assay is 
hiisisd on the us« of the 5' ' nuclease assay first 
described by Holland et al. (1993), The method 
uses the -*>' nucleate activity of 't\u{ polymerase to 
cleave a noncxtcndlblc hybrid! /.at ion probe dur- 
ing the extension phnse of I'CH- The approach 
uses dual-labeled fluorogenic hybridisation 
probes (Lcc ct a). 19^3; Jlussler el al. UvaU 
ct al, 199£a,b). One fluorescent dye serves as a 
reporter (FAM (i.e., G-carbaxyfluoresvcin)! and its 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (i.ft., n-carboxy-ietramethyi- 
rhodaminc). The nuclease degradation of the hy- 
brtdi/atUm probe releases the quenching of ihe 
I 'AM fluorescent emission, resulting in an In- 
crease hi peak fluorescent emission at S1& n«n, 
Hie use Of a sequence detector (A151 Prism) allows 
measurement of fluorescent spectra of all 96 wells 
of the thermal cycler continuously during the 
1*CK amplification. Therefore, the reactions aje 
monitored in real time. T3)C out pm data is de- 
scribed and quantitative analysis of input target 
UNA sequences is discussed below. 



RESULTS 

PCR Produce Dereaion in R«<*1 Time 

The goal was to develop a high-throughput, sen- 
sitive, and nceuratc gene quantitation assay for 
use In monitoring lipid mediated therapeutic 
gene delivery. A plasinld encoding human factor 
Vlli gene sequence, pI'8TM (see Methods), was 
used as a model therapeutic gene. The assay usr* 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence in real 
time (Alii Prism 77(H) Sequence TVtcrlnr). Hie 
Taqman reaction requires a hybridization probe 
Jalxlcd with two different fluorescent dyes. One 
dye is a reporter dy« (l-'AM), the other is X quench- 
ing dye (TAMRA), When the pmU: 1* inlacl, fluo- 
icsecnt energy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAKfRA). During the extension 
phase of the TCK cycle, the nuoresccni hybrid- 
l/willoii prol>c U cleaved by t!ie 5'-.'*' nucleolytic 
activity of the. ON A polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
su J til ik hi ei n increase of the reporter dyu fluores- 
cent emi-^hj" *peetra. VCH primcr.s und probur* 
were designed ft" the human factor Vlli se- 
quence and human p-actln gene (a* dt^cril>ed in 
Methods). Optimization reactions were per- 
formed to choose the appropriate probe und 
•magnesium concentrations yielding ihe highest 
Intensity of rej^ortcr fluorescent signal without 
sacrificing specificity. The Instrument uses a 
ehargi>cowplcd device (i.e., CCD etunerii) for 
measuring the fluorescent emission apectrw from 
SOO t*i 6$0 nm. Kach VCAX tube was monitored 
sequentially for 2f> rn.scc with continuous moni- 
toring through out the amplification. ISftch tVtbc 
wa.% rr-cxan dried every B*5 see. Computer x>((- 
ware. was designed to examine the fluorescent in* 
tensity of both the reporter dye (FAM) and 
the quenching dye (TAMKA). The fluorescent 
intensity of the quenching dye, TAM11A, changes 
very Utile over the course of the PCR amplifi- 
cation (dattt not shown). Therefore, the intensity 
of TAMRA dye emission serves hs an internal 
.tlandard with which to normulbic the reporter 
dye (FAM) emission varindons. The software cal- 
culates a value termed ARn (or ARQ) using the 
following equation: ARn - (Un J ) (R«»')i where 
Hn 4 enitoNlun iulenshy of reporter/entission In- 
tensity of quencher at any given time In o-reae 
rion tube, and Ru r- emission intensitity of re- 
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ponCT/OmlS5i(7T) Jmcmily i?r qucnt:licr measured 

prior 10 TCK iimpliiication in Uiar same reaction 
tube. For the purpose of quantitation, the lasi 
three data points (ARns) cotieca-d during rhv. ex- 
tension step for each PCK cycle were analyzed. 
The nucleolytic degradation of the nyundi/ation. 
probe occurs during the extension phase or I'tzu, 
and, therefore, reporter fluorescent tniusMun in- 
creases during this time. 'Jin: tlnce data point* 
were averaged for each KJX cycle and the uk-hm 
value fur each was plotted in an "amplification 
plot" shown In J'ljjurc 3 A. The Attn mean value is 
plotted on the )A.axis, and time, represented by 
cyck- number, is plotted on thv*-axis. During the 
«ar]y cycle* of the FOR amplification, the ARn 



value remains at base line Whim sufficient hy- 
bridization probe \\U$ boon cleaved by the Ttu) 
l*>lymeruse nufloi/M activity, the intensity of re- 
porter fluorescent emission iiiereatiet.. Mod \K'A\ 
umplifioiljons reach u pl&ltao phaN<< of reporter 
fJuorcHC.nl omission if the reaeliun is carried nut 
lo high cycle nujnhvis- The amplification plot 3ft 
examined euily in th* reaction, ut a point lhai 
icpi-csents ilw log phase of product arnwmla- 
lion. This is done by us&ifinlng an aibiljury 
threshold thai is based on the variability of the 
base-line did*. In Figure 1 A, the threshold wax set 
in in standard deviation* above, the mean of 
haw? line emission calculated frwn tydea 1 lo 1 fv 
Once the threshold is chosen, the point at which 
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Figure 1 PCR product detection in real time. (-4) The Model 7700 *jUwan? will construct amplification plots 
from the extension phase fluorescent emission data collected during the PCR amplification. The standard de- 
viation is determined 1rom the data points collected from the base line of the amplification plot. .values are 
calculated by determining the point at which the fluorescence exceeds a threshold limit (usually 10 times the 
Standard deviation of the base line). (S) Overlay of amplification plots of serially (1:2) diluted human genomic 
DNA samples amplified with fi^ctin primers. (Q Input DNA concentration of the samples plotted versus t T . All 
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the amplification plot crossed the ihrctihold'-ivckt 
fined as C r . C r is reported us the cycle number ai 
this point. Ar will be demonstrated) (hi* C\ .value 
is piedieiive of ihc quantity of input hirget. 

Cj Values Provide a Quantitative Measurement 6*' 
Inpur Targer Sequences 

Figure IB shows amplification plot* of 1^ dirim- 
ent TCR amplifications overlaid. 'lT>c ampiifau* 
Hons wore performed on a 1:2 serial dilution 
human genomic DNA. *i*hc amplified targei w:u 
human p aetln, The amplification plotK Jihifi to 
the right (to higher threshold cycles) ns the. input 
target quantity is reduced. 'ITus is wpoctad b<!- 
ranutu nmctlortK with fewer starting enpicut of tlic 
target molecule require greater amplification to 
degrade enough probe to attain the threshold 
fluorescence. An arbitrary threshold of 10 stan- 
dard deviations above the base line was used to 
determine the G r values. Figure 10 represents the 
C T value* plotted versus the sample dilution 
value, Each dilution was amplified in triplicate 
! 3 c:r amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C T values decrease linearly with increas- 
ing target quantity, Thus, O r values can be used 
as fi quantitative measurement of the input target 
numbe.T. It should be noted that the amplifica- 
tion plot for the 15.6»ng sample shown 3n Figure 
1H does not reflect the snmc fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15,6-ng sample also achieves cndpoinl pla- 
teau at a lower fluorescent vaJuc than would he 
expected based on the input PNA. This phenom- 
enon has been observed, occasionally with other 
samples (data not shown) and may be attribut- 
able to late cycle inhibition; this hypothesis is 
still under investigation. It is important to note 
lhat the flattened slope and early plateau do not 
impact significantly the calculated C, value as 
demonstrated by the fh on the line shown In 
Figure. 1 C. All triplicate amplifications resulted in 
very similar Cr values— the standard deviation 
did not exceed 0,5 for any dilution, this experi- 
ment contains a > 100,000-fold range of Input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The linear range. oi iluoresccm in- 
tensity measurement of Hie ABI I'rlam 7700 $c- 

annrvr 
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mimts over n very large r^iiji*' nf relative starting" 
target quantities. 

Sample rYepararion Validation 

Several parameters influence the ofileU'wry uf 
FOR amplification; magnesium and sail concou? 
trations, reaction conditions (i.e., time and tem- 
perature), PCH target size and composition, 
primer sequences, and sample purity. Ail of The 
.above factors are common to a single VCR assay, 
except sample to sample purity, in an effort to 
validate the method of sample preparation for 
the lactor VJil assay, PCR amplification reproduc- 
rbihty and olficiency ol 10 replicate sample 
prejxarations were examined. After genomic DNA 
was prepared from the 10 replicate samples, the 
DNA wan quaiJlUatcd by ultraviolet spectroscopy, 
Amplifications were performed analyzing p-aciin 
gene content in 100 and 25 ng of total genomic 
UNA. Each I'CK amplification was performed in 
triplicate. Comparison of C r values for each trip, 
licate sample show 'minimal variation based on 
standard deviation and coefficient of variance 
(Tabic 1). Therefore, each ol the triplicate PCtt 
amplifications was highly reproducible, demon- 
strating that real time PCR using this instrumen- 
tation introduce* minimal variation into the 
quantitative PGR analysis. Comparison of the. 
mean C n values of the 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for p-nctln gene quantity. The highest C y 
difference between any of rhe samples was 0.85 
and 0.73 for the MX) and 25 ng samples, respec- 
tively. Additionally, the amplification of cad) 
sample exhibited an equivalent rate of fiuorev* 
cent emission intensity change per amount of 
DNA target analyzed as indicated by similar 
slopes derived from ihc sample dilutions (Fig. 2). 
Any sample containing an excess of a PC)i inhibi- 
tor would exhibit a greater measured 3-actln C r 
value for a given quantity of DNA. In addition, 
the inhibitor would be diluted along with (In; 
sample in the Uilu tUm analysis (Fig, 2), altering 
the expected C,* value change. Each sample am- 
plification yielded a similar result in the analysis, 
demons I rating" that this method of sample prepa- 
ration is highly reproducible with regard to 
sample purity. 

Quantitative Analvsis of a Plasm id After 

7 fl C « AQ/ «ff« W J ftCtfrT 70fl7/Cn/7T 
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Tablo 1 . Raproduclblllty of $*mpl« Preparation Method 
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0,17 
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0.15 
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(or containing a partial cUNA for human factor 
VIII, pl-'KTM. A scries of transfeciions was sot 
up using a decreasing amount of the plasinid\40, 
4, 0.5, and 0.1 jig). Tvvrniy-four hours po.si- 
trti i infection, total wn$ purified from each 

flask of eelb. p-Actiu gent quantity wa^ ehux*n as 
<t value for normal i/:at it *n of ^v.uuniic DNA con- 
centration frutn each sample. In this cxpetinivnt, 
(i-acun gene content should remain constant 
relative to rorat genomic DNA. Figure :t shows the 
result of the p-actin DNA measurement (100 jig 
total DNA determined by ultraviolet spectros- 
copy) of each summit. Kach sample was analyzed 
in triplicate and the mean p-aetin Or values of 
the triplicates were, plotted (error bars represent 

/-..^ «*-j-?#Urri H«\/iuiiom *l hp htflh**£T niffrrrnrr 



bt k tw<H-tn any two samplo moan is was 0.95 C,- Ten 
nanograms of total UNA of each sample were also 
examined for p-actin. The results again showed 
that very similar amounts of genomic DNA wore: 
present; 'the maximum mean |i actin <";, value 
difference waa 1.0. As Hgurc 3 shows, the rate of 
P-actln C r change between the 100 and 10-ng 
sujnnlc* was similar (sJojjc value;; rang« hwtwoen 
3,56 and -3,45). This verifies again thilt the 
method of .sample prcparailon yields sajinptas of 
identical PCR integrity G.c, no sample contained 
an excessive amuuul uf a PCR inhibitor). How- 
ever, these results indicate that each sample con- 
tained slight diffeienc.es in the actual amount of 
genomic DNA analysed. Determination of actual 
tfenumic DNA concentration was nccompUsbed 
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Figure 2 Sample preparation purity. The replicate 
camples shown In Table 1 wore also amplified .In 
tri pic-ate using 25 ng of each DNA sample. The fig. 
lii* shows the input DNA concentration (TOO and 
25 ng) vs. C, In ihf- 1irjur<*. ih*> 100 nnd 7.5 ng 
points for «ach tarn pie are connected by a line. 



hy plotting the mean (i-actin value obtained 
for each 100-ng saniplv oct a p-aclln standard 
i.-uive (shown In J'ig. 40)« The ocluaf genomic 
ONA concentred !<»» of c«ch simipk:, tt, was ob 
tallied by extrapolation to the X- 

Figure 4A shows the measured (S.ti,, iton- 
normalised) qu<uitilie=. uf /actor VJJJ plaamid 
ONA (pJWM) from each of tlu: four transient cell 
tm">fccUona. Each reaction contained 100 n£ of 
total sample? UNA (aa determined by UV spectros- 
copy). l r -ach sample was uualyzed in triplicate 
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Figure 3 Analysb of lianslectcd cdl DNA quantity 
and purity. I he DNA preparations of the lour 293 
cell transactions (40, 4, 0.5, and 0.1 p.g of pF8TM) 
were analyzed for the 0-actln gene. 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate, For each 
amount of pF8TM that was transacted, the {d-actln 
C T values are plotted versus rhe total Input DNA 



PC.r< amplifications. As shown, pl'SI'M purified 
•hoic Jhc 293 cells decreases (mean C ( values in- 
cuttuej with decreasing amounts <if plasmld 
firuit^lrUcd. The mc»« C A values obtained for 
pFBTW in Tigure 4A were pJotted on a standard 
curve comprised uf seihiHy diluted pKHTM, 
shown .in figure 4B. The quantity ui pi'tfi'M, b, 
found in each of the four tranKfections was do 
tcrmined by extrapolation to the x axis of the 
standard curve In l'igurc 4H, These uncorrected 
values, h, for plWM were nor mailed io deter- 
mine the actual amount of pl'8TM found per 100 
riK of genomic DNA by using Ihc equation:. 

b x 10 0 nft ycuial pl'fri*M copies per 
^ r 100 ng of genomic DNA 

where a actual genomic DNA in u sample and 
fi w pPHTM copies from the standard curve. The 
normalised quantity or pl'ftTM per 100 ng of ge- 
nomic DNA for each of the four Irnnsfectlom Is 
shown in Figure 4JJ. 'Ilie,se rr-iulla Ah cm that ihc 
Cjunntlty of factor Vill plasiuid associated wuh 
thO 2V3 cells, 24 hr after irunsfvMion, decu:.jses 
with decreasing plti>nild (..tinL.tnUiiitiau u.sed in 
the transection. The quantity of pJWM njwocJ- 
atcu with 293 celb, after transfectlon with 40 u-g 
Of ptaKiiiid, was 35 pgp^?r 100 ng jjBnoinlc DNA. 
TlUs results in -520 pi as mid copies per cell. 



WSCUSSION 

Wo have described a new method for qunntitnt- 
ing gene copy numbers using rcaMlmc anulysls 
of PCK amplification*. Real-time PCX ^ compat- 
ible with cither of the two PCR (HT-PCR) ap- 
proadied: (1) quantitative competitive wlicre an 
iiHcinid compcliioi' for each target sequence is 
used for norm all nation (data not shown) or (2) 
quantitative comparative PCH ush/g m ikhukiUzci- 
tion gene conlained within the sample (i.e., p-ac- 
tin) or a "housekeeping" gene for RT-VCK. Ff 
equal amounts of nucleic acid are analy/ed for 
eacn sample and if the amplification effkiency 
before quantitative analysb o identical for each 
sample, the lnreniai con I it jI (n^jmali^itioti gene 
or competit(3r) should give equal Mgnals for alJ 
samples. 

The real-time PCU method offers several ad- 
vantages over the other two methods currently 
employed (see the introduction). I**irst, the real- 
time l'CR method is performed in a doscd-tube 
system and requires no pwt-PCK. manipulation 

T /-» O OA* O*. (J TTV.T rtn*f»T '7T 
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Figure A Quantitative analyiii* of pFSTM in transfcctxrd cell*. (A) Amount of 
plasmid DNA used for the trunsfectlon plotted against Hit: huhim C, value deter- 
mined for pfSTM remaining fa fc r aflcr translection. (D,Q Standard curve* of 
pPfiTM 3nd fi-a<dn, respectively. pfftTM DNA (0) and genomic UNA (Q were 
diluted iftrUly 1 :S before amplification with the appropriate primers The p-actin 
standard curve waif used to normalise the results of /Uo 1 00 rtcj of genomic DNA. 
(0) The amount of pFSTM present p<:r 100 ng of genomic DNA. 



of .QinpKv Therefore, I lu* potential for PCll con- 
tamination in the laboratory is reduced because 
amplified productK cam l«» a indyred and disponed 
of without opening the reaction tubes. Second, 
this method suppotls ihv us« of a i iort i r«i 1j/:<it J ou 
«cnc (Lc., p-actin) for quantitative PGR or house- 
keeping genes for quantitative RT-i'Ck controls. 
Analysis is performed in real time during the Jog 
phase of product accumulation. Analysis during 
h>K phase permits many different genes (over a 
wide input tnr^t range) lo be analyzed simulta- 
neously, without concern of reaching rend ion 
plateau at different cycles This will make mull I- 
#gt\% analysis assays much casKn lw develop, be- 
cause individual internal uufipetliui> will nut be 
needed for coch gene under analysis. Third, 
sample throughput will iiH.jca.vc drttMialicedly 
with the new method because, there is no |>oAt- 
TCK processing time. Additionally, woiking In a 
96-wcll format h highly compatible with auto- 
mation technology. 

The real-time PCR method us highly repiru 
ducible. RcpMcaic amplifications can be analyzed 



for ?nch sample minimizing jxrtcntlal error. The 
sy siitin allow* i\>x a very large assay dynamic: 
range fuppToauliing 1,000,000- fold starting Uu- 
gei). Ue*ln« u .Niandord curve for the. target oi in- 
terest, relative copy number values can be deter- 
mined for any uukriuwji vuiuplv. Fluorescent 
threshold values, Cj V concJair. lifiearly with rela- 
tive DNA copy number.*. Heal time quantitative 
KT-rCJt methodology (Gibson et al., this l<uu<«) 
ha.1 alsoheen developed. Finally, real time quan- 
titative I*CU methodology can be used tu develop 
high-throughput screening assays for a variety of 
applications [quantitative gene cafnesoion (KT- 
PCft), gene copy nssaya (Mcr2, II1V, etc.)* £cn re- 
typing (knockout mouse analysis), and immuno- 

renj. • 

Real-time POU may al.w he ]>crforrncd using 
jntcrcnlalhig dyes (Hlguahi ei al. such as 

cfJiiditim bromide. The fluorogenic probe, 
method offers a major advantage over inter- 
calating dyes-- greater specificity (i.e., primer 
dtmvrs and .non sped fie PCR products are not de.- 
tftried). 
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MfcTHOlPS 

Generation of <t Piasmld Containing a Par-rial 
cDNA for Human Factor YIll 

ToWl RNA w<o harvested (UNAwd H-Owiu T<4 Tc«, Inc., 
Jwndiwood, TX) from ev)l> li-mfecled with a factor VI II 
expression vtwlar, pClSZttcWU (Katun el til.' 1VH6; Cor» 
mon ct al. 1900). A factor VH1 purtlat cl >N A Mvpirmv WAS 
ftt-iutrnicd by ItT l»C:1t (cJonoAmp \W. (Tlh UNA 1»f.U Xll 

(pan N8«K-cnyy, rtApphed isiosysicsns i : twtvi CMy, c;A)j 

using the I'CIt primers FHfor hiu! l-*Rrev <prin»-r srqueiirr* 
arc shown below). 1*1 ie ampHcon was reamplifird ittlnft 
modified MHo* aod Wrcv primers (uppciulvd with HamiU 
and HwdUl rvsi/lction site sequences «t iJiv v uulf and 
clonal i"t« ptiKM- 3Z (Pumic^u Corp., Mudixon, Wl). The 
resulting el wir, pPSTM, was used lor transient transfecdoti 
of i&W ecJK 



Amplification of Target DNA ami Duccibn of 
Amplicon Factor VIII Plasmid DNA 

(pFfJTM) was amplified, Willi \\tv |»liiuri* itifor S'-iXXl* 

<iTCi<;<^\ACiAU:j'CiAiXi(Cn'0.3' and Wrev />'-AAAc;crr- 
t^CCXrrOCiATO(j'rACiCi-rtM1i« rvni'llun pioduwd « 422- 
np K:k product. The forward primer was denned lu icv 
agnize u uiilipiv mmjuvhu* ft mi id In the .V untranslated 
region of tin; pa mil uC132.o\2S)> pi a, mi ml n * it! therefore 
Ouvi i tut kvwkoUu amplify the human foclor VIII 
£Vtk*„ Prim ore wore chosen with the swift* iicr* of I ho com. 
jmlcr program Olij;o (Nut im in I hi sciences, Inc., IMy- 
mouth, WN). The human |J-actl« gwic w «* amplified with 
llic primer* fS-tir»in forward primer S'TCACOCACAt TCT 
GCCCATCI7\f:OA-.V and fcJ-acthj reverse piiwcr V-( \A( I. 
CGCAACCX*CTiX:Anc<x;AATGG-3\ The reaction pro- 
duced a zys-np pCk product. 

Amplification reactions (SO (J) contained a DNA 
sample, H)x PCU Hufff.r II (S 200 *iM dATP, dCTl\ 
dGTP, and 400 p,M riUTP, 4 im< MgCl,, l.ZS Units Amph 
TiU) r;NA polymciasc, 0,5 unit AinpKrnvc uracil /v-jjty* 

i-uaylaav (UNO), £0 prnolv of CAch focloi VIII ftrlmci, unci 15 
pMioh* of uut^li |< Actio pilnicr. '/1i<r leactlwiv^ iiKo «:unltttncd 
One Of the fp Mowing detect Ion prill x*s (KMJ nw rrirli)* 

p»iirt»be A'(i'AW>Acunvrrc:cut:irrc;crn , (n w n , <:rcjT"- 

GCCTT(TAMRA)p 3' «ud p-«t-tin proU- 5 f (FAM)ATC3C3C:t^ 
■XCIj\MKA)CCCCCATCCCATCp-.T where p indicates 
pluMphoryl/dinn nnd X indicates a tinker arm nucleotide. 
RcacUcm (ulnr? wen* MicutiAmp Optieat 1\il>cs {part AtlrtV 
uerNKOI Pcridn lUniur) tlwt wore fro cu^d («t IVrhln 

F.lmcr) prvvc-id Kg Id from reflecting. *l\ibc capi w<«rc 
slmtl^i' m Mtt-roAiiip C^npn t>ut specially designed to pre- 
vent light scattering. .All ol tli<« PtlU iUni/*uiniihli-* were 
I^Kjd 1>y PK Applied ltiwyricm* {I'oalor Cllty, CX) except 
thr factor VIU ptUuen, which weie .tynthesl/rd at Cenvn 
tccli, Inc. (South f-wii Pranclsco, CA). Prohey wiw desjj;m'd 
u.^ng the Oli;;f.» 4.0 xoftwarc, folknving guUU'lIuot kuj;- 

^csichi in tnc Model 7 700 .sequence J>eta*u>r lii.viuoiiefil 
manual. Itrlcfly, probe T m ^muUl he Al least 5 U C hlfihrr 
Ulan me amu-ulluji iviu|Aialtirc u.tcd dufloj; Ihcnnul cy- 
rhngi primers should Jiot Xoioi Nt«*hle diiplexes' with (he 
probe. 

The thcnunl »*ycllng conditions lm-lutWd 2 jiiIji dt 
SOrC and 10 min at 95"C. 'Hiej-mal cycling procerdrd with 



reactiont; were performed io tho Modpl 77tllKScqucnce IX*- 
Wnior (Pt ApplU'd Uiosyslvuik), wldrh couuUiv m Ocne- 
Alop »»<":U Sy&tvni P«X>. lU:a*:llon cmidition^ w*-rr- pin- 
gruilllltua oo .i I'uwvr Mncintcwh 7100 (Apple Ovn pot r»r, 
Sflnta Clara, CW) linked dirvtiiy to the Model V'^OA .V- 
(juciicv JXtlffctor. Analy« u *» f daU w * i ,,lk; **■ l«^' ,fm '^ 
the M n l rah computer. Olloetloit and analydti cofiw;4ro 
wi\* develoiwl x\ IT\ Applies! Wosyntunis. 



Traiufection of Ceils with Factor VIII Comtrucl 

Pour T17.S flasks of 29^ cells <AT«: C\\\. 1573), a human 
fetal kidney su(ipen«ion cell line, wvre (frown to ROOfi con- 
llucrwy inc! transfceted pl ; rfi*M. Cells were growti in thv 
following media; S0% HAM'X HI 2 without GHT, WV> low* 
^ducose DnJlKCtNt's modified Kaj;lo maliunt (liMHM) wiUi» 
can filyone. wiUi sodium bicarbonate, J0% leml txmnc 
scnmi, 2 iiim L-gJutariiDr, and 1% penicjllin-strcptomy^ 
t|o. The media w«a vliaogcd 30 rnln l>rfo«* the Ironsfcc 
tion, pl : UTM DNA Amounts of 40, 4, OA, And 0.1 m\ were 
Milled U> ml of « sohitlon containing 0,12* m CUiCJ,.- 
And 1 X MWUiS. The four mixhm-i were left at room tein- 
pv.rm.irr f<« TO mln ai»d then iiddv*! dmpwL«- to die cells. 
'Hie n**>i* »vvi»- hiuul>ated al 37°C and 5% < :(X for 24 hr. 
washed with PUS, a>id Suspended Irt lMiS. Tin 1 k'hiim 
jA-ndvrd cella were divided into tdupmh uod UNA WA4 CV- 
tmtited luirncdlutcly luiiiR thv QIAump Ki»n«i Kit (Qfifjon. 
aiflUwvrtli, C*A>, !>NA vy».s clut<*d Into 200 P l ol 20 p.m 
TfU-UCl ut pll B.0, 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyiihe repressive promoter, and («) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24 J. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to >30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to >40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsh) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constttutively active glycogen 
synthase kinase-3/3 (GSK-30) resulting in an increase in 
0-catenin levels. Stabilized j3-catenin interacts with the tran- 
scription factor TCF/Lef 1, forming a complex that appears in 



The publication costs of this article were defrayed in part by page charge 
payment. This article must therefore be hereby marked "advertisement" in 
accordance with 18 U.S.C §1734 solely to indicate this fact. 
© 1998 by The National Academy of Sciences 0027-8424/98/95 1 47 17-6S2.00/0 
PNAS is available online at www.pnas.org. 



the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
/3-catenin levels (9). APC is phosphorylated by GSK-3p, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamo'is 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two ceil lines might contribute 
to the transformed phenotype.- 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP- 1 
and WISP-2, and a third related gene, WISP-3, The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 p,g of poly(A)"*" RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 ptg 
of poly(A)* RNA from the parent C57MG cells. The sub- 
tracted cDN A library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
. coding full-length mouse and human WlSP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WlSP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA, PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<a«) w here ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The WISP-specilic signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/ Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on 0-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-I mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-l mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse \yiSP-l. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of ^40,000 (M T 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ^27,000 (M r 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 

C57MQ 
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Fig. i. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/Wnt-1, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 /xg) was subjected to Northern blot . 
analysis and hybridized with a 70-bp mouse WISP- 1 -specific probe 
(amino acids 278-300) or a 190-bp WISP- 2- specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were re hybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human W1SP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-L 

Identification of WISPS, To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyi- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3/1). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1, WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP- 1 is the same as the recently identified 
Elm I gene. Elm I is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-I (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced byTGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, . similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only sue of the 10 cysteine residues (Fig. 3 A and B), 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative . 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-l expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WlSP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-l and WISP-2. Expression of 
WISP-l and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-l was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-l expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-l, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A,C, E, and G) Representative hematoxyiin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-l expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of MSP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and O), and tumor cells are negative. 
Focal expression of WISP-l, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (£ and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and //). 



the predominant cell type expressing WISP-l was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-l is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary Fine 
mapping indicates that WISP-l is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1, Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression ofWISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-l resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-l 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-l locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-l and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-l in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-l (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 




Fig. 5. Amplification of WISP-I genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 /xg) 
digested with £coRI {WISP-I) or Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-l probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10- fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WlSP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISP-3 RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 
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mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP- 1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors , induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., 0-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed ceils, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through /3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v ft serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-01 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-'2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2 , describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down -regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic /3-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 

We thank the DNA synthesis group for oligonucleotide synthesis, T. 
Baker for technical assistance, P. Dowd for radiation hybrid mapping, 
K. Willert and R. Nusse for the tet-repressible C57MG/Wnt-1 cells, V. 
Dixit for discussions, and D. Wood and A Bruce for artwork. 

1. Cadigan, K. M. & Nusse, R. (1997) Genes Dev. 11, 3286-3305. 

2. Dale, T. C. (1998) Biochem. / 329, 209-223. 

3. Nusse, R. & Varmus, H. E. (1982) Cell 31, 99-109. 

4. van Ooyen, A. & Nusse, R. (1984) Cell 39, 233-240. 

5. Tsukamoto, A. S., Grosschedl, R:, Guzman, R. C, Parslow, T. & 
Varmus, H. E. (1988) Cell 55, 619-625. 

6. Brown, J. D. & Moon, R. T. (1998) Curr. Opin. Cell. Biol 10, 
182-187. 

7. Molenaar, M., van de Wetering, M., Oosterwegel, M., Peterson- 
Maduro, J., Godsave, S., Korinek, V., Roose, J., Destree, O. & 
Clevers, H. (1996) Cell 86, 391-399. 



Proc. Natl. Acad. Sci. USA 95 (1998) 

8. Korinek, V., Barker, N., Willert, FC, Molenaar, M., Roose, J., 
Wagenaar, G., Markman, M, Lamers, W., Destree, O. & Clevers, 
H. (1998) Mol Cell Biol 18, 1248-1256. 

9. Munemitsu, S., Albert, I., Souza, B., Rubinfeld, B. & Polakis, P. 

(1995) Proc. Natl Acad. Sci. USA 92, 3046-3050. 

10. He, T. C., Sparks, A. B„ Rago, C, Hermeking, H., Zawel, L, da 
, Costa, L. T, Morin, P. J., Vogeistein, B. & Kinzler, K. W. (1998) 

Science 281, 1509-1512. 

11. Diatchenko, L., Lau, Y. F., Campbell, A. P., Chenchik, A., 
Moqadam, F., Huang, B., Lukyanov, S., Lukyanov, K., Gurskaya, 
N„ Sverdlov, E. D. & Siebert, P. D. (1996) Proc. Natl. Acad. Sci. 

. USA 93, 6025-6030. 

12. Brown, A M., Wildin, R. S„ Prendergast, T. J. & Varmus, H. E. 
(1986) Ce//46, 1001-1009. 

13. Wong, G. T., Gavin, B. J. & McMahon, A. P. (1994) Mol CeU. 
Biol 14, 6278-6286. 

14. Shimizu, H., Julius, M. A., Giarre, M., Zheng, Z., Brown, A. M. 
& Kitajewski, J. (1997) CeU Growth Differ. 8, 1349-1358. 

15. Hashimoto, Y., Shindo-Okada, N., Tani, M., Nagamachi, Y.. 
Takeuchi, K., Shiroishi, T„ Toma, H. & Yokota, J. (1998)/. Exp. 
Med. 187, 289-296. 

16. Zhang, R*, Averboukh, L., Zhu, W., Zhang, H., Jo, H., Dempsey, 
P. J., Coffey, R. J M Pardee, A. B. & Liang, P. (1998) Mol. Cell. 
Biol. 18, 6131-6141. 

17. Grotendorst, G. R. (1997) Cytokine Growth Factor Rev. 8, 171- 
179. 

18. Kireeva, M. L., Mo, F. E., Yang, G. P. & Lau, L F. (1996) Mol. 
Cell Biol 16, 1326-1334. 

19. Babic, A. M., Kireeva, M. L., Kolesnikova, T. V. & Lau, L. F. 
(1998) Proc. Natl. Acad Sci. USA 95, 6355-6360. 

20. Martinerie, C, Huff, V., Joubert, I., Badzioch, M., Saunders, G., 
Strong, L. & Perbal, B. (1994) Oncogene 9, 2729-2732. 

21. Bork, P. (1993) FEBS Lett. 327, 125-130. 

22. Kim, H. S., Nagalla, S. R., Oh, Y., Wilson, E., Roberts, C. T., Jr. 
& Rosenfeid, R. G. (1997) Proc. Natl. Acad. ScL USA 94, 
12981-12986. 

23. Joliot, V., Martinerie, C, Dambrine, G., Piassiart, G., Brisac, M., 
Crochet, J. & Perbal, B. (1992) Mol Cell Biol 12, 10-21. 

24. Mancuso, D. J., Tuley, E. A, Westfield, L. A., Worrall, N. K., 
Shelton-Inloes, B. B., Sorace, J. M., Alevy, Y. G. & Sadler, J. E. 
(1989)7. Biol Chem. 264, 19514-19527. 

25. Holt, G. D., Pangburn, M. K. & Ginsburg, V. (1990) /. Biol 
Chem. 265, 2852-2855. 

26. Voorberg, J., Fontijn, R,, Calafat, J,, Janssen, H., van Mourik, 
J. A. & Pannekoek, H. (1991)/. CeU. Biol. 113, 195-205. 

27. Martinerie, C, Viegas-Pequignot, E., Guenard, I., Dutrillaux, B., 
Nguyen, V, C, Bernheim, A. & Perbal, B. (1992) Oncogene 7, 
2529-2534. 

28. Takahashi, E., Hori, T., O'Connell, P., Leppert, M. & White, R. 
(1991) Cytogenet. CeU. Genet. 57, 109-111. 

29. Meese, E., Meltzer, P. S., Witkowski, C. M. & Trent, J. M. (1989) 
Genes Chromosomes Cancer 1, 88-94. 

30. Carte, S. J. (1993) Crit. Rev. Oncog. 4, 435-449. 

31. Zhang, L., Zhou, W., Velculescu, V. E., Kern, S. E., Hruban, 
R. H., Hamilton, S. R., Vogeistein, B. & Kinzler, K. W. (1997) 
Science 276, 1268-1272. 

32. Sun, P. D. & Davies, D. R. (1995) Annu. Rex: Biophys. Biomol. 
Struct. 24, 269-291. 

33. Kireeva, M. L, Lam, S. C T. & Lau, L. F. (1998)/ Biol Chem. 
273, 3090-3096. 

34. Frazier, K. S. & Grotendorst, G. R. (1997) Int. J. Biochem. Cell 
Biol 29, 153-161. 

35. Wernert, N. (1997) Virchows Arch. 430, 433-443. 

36. Tanner, M. M., Tirkkonen, M., Kallioniemi, A., Collins, C, 
Stokke, T. t Karhu, R., Kowbel, D., Shadravan, F., Hintz, M., Kuo, 
W. L. et al. (1994) Cancer Res. 54, 4257-4260. 

37. Brinkmann, U., Gallo, M„ Polymeropoulos, Ml H. & Pastan, I. . 

(1996) Genome Res. 6, 187-194. 

- 38. Bischoff, J. R„ Anderson, L., Zhu, Y., Mossie, K., Ng, L., Souza, 
B., Schryver, B., Flanagan, P., Clairvoyant, F., Ginther, C, etal. 
(1998) EMBOl 17, 3052-3065. 

39. Morin, P. J., Sparks, A B., Korinek, V., Barker, N. t Clevers, H., 
Vogeistein, B. & Kinzler, K. W. (1997) Science 275, 1787-1790. 

40. Lu, L. H. & Gillett, N. (1994) CeU Vision 1, 169-176. 



letters to nature 



methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPM1 growth medium. T-cell-proliferation assays were 
done essentially as described 10,21 . Briefly, after antigen pulsing (JOjigmr 1 
TTCF) with tetrapeptides (I-2mgmr'), PBMCs or EBV-B ceUs were 
washed in PBS and fixed for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-ceU clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 |xCi of 5 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u,g TTCF with 0.25 u.g 
pig kidney legumain in 500 u.1 50 mM citrate buffer, pH 5.5, for 1 h at 37 °C. 
Glyco peptide digestions. The peptides HIDMEEDI, HIDN(N-glucosamine) 
EEDI and H1DNESDI, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5mg reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopeptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N- terminal sequencing. The iyophilized transferrin- 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mUml" 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgml -1 ct- 
cyanocinnamic acid in 50% acetonitrile/0. 1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune -cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family Iigands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL/ (Fig. 2a), but not to cells transfected with 
TNF 5 , AP02L/TRAIL 6 * 7 , Apo3L/TWEAK 8,9 , or OPGL/TRANCE/ 
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RANKL 10 "' 2 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1, Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d = 0.8 ± 0.2 and 
U±0.lnM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble- FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 u.gmr'. Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (A1CD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 13 , activation 
of interleukin-2-stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes 114 " 1 * 1 . Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at -1 fig ml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site {arrow}, the 
cysteine-r ich domains (CRD 1 -4), and the Ay-linked glycosylation site (asterisk) are 
shown, b. Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A}* RNA (Clontech) from 
human fetal and adult tissues or cancer cell lines. PBL, peripheral blood 
lymphocyte. 
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Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area), TNFR1 -Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference [P < 0.001 ) between the 
binding of DcR3-Fc to cells transfected with FasL or pRK5. PE, phycoerythrin- 
labelled celts, b. 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFRI-Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resotved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag. e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3 -based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM218xe7 (Tl 60), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone's insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival. 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
Iigand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2J9 . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 
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Figure 3 Inhibition of FasL activity by DcR3. a, Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasL:.5ngml"') oltgomerized 
with anti-Rag antibody (0.1 u.g ml -1 ) in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGl arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b. Jurkat cells were incubated with sFasL-Flag plus anti-Ffag antibody 
as in a, in presence of 1 ^9 ml -1 DcR3-Fc (rilled circles). Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2. 
followed by control {white bars) or anti-CD3 antibody {filled bars), together with 
phosphate-buffered satine (PBS), human IgGl, Fas-Fc, or DcR3-Fc (lOn-gmr 1 ). 
After 16 h. apoptosis of CD4* cells was determined (mean ± s.e.m. of results from 
five donors), d, Peripheral blood natural killer cells were incubated with s, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles). Fas-Fc (open 
circles) or human IgGl (triangles), and target-cell death was determined by 
release of 5, Cr (mean ± s.d. for two donors, each in triplicate).- 
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Figure 4 Genomic amplification of DcR3 in tumours, a, Lung cancers, comprising 
eight adenocarcinomas (c, d. f. g. h, j, k, r), seven squamous-ce!l carcinomas {a. e, 
m, n, o, p, q), one non-small-cell carcinoma (b). one smail-cell carcinoma (i). and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c. In situ hybridization . 
analysis of DcR3 mRNA expression in a squamous-cell carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-field image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S). blood vessel {V) and necrotic tumour tissue 
(N) are also shown, d, Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward, Rev and Fwd). the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's f-test 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-l has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L 22 . Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-family member is 
OPG\ which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR-family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. D 



Methods 

Isolation of DcR3 cDNA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the coding regions of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described". 

Fluorescence-activated cell sorting (FACS) analysis. We trans fee ted 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human VzsV (2 u-g), together with pRK5 encoding CrmA 
(2>xg) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc or TNFRl-Fc and then with phycoerythrin-conjugated 
streptavidin (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown), it is possible that DcR3 recognized some other factor that is 
expressed constitutively on 293 cells. 

Immunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [ 3S SJcysteine and | J5 S] methionine (0.5 mCi; 
Amersham). After 16h of culture in the presence of z-VAD-fmk (10u,M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5 u.g), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Flag-tagged soluble FasL (1 p-g) (Alexis) was incubated 
with each Fc-fusion protein (1 p.g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25 ^g) was 
incubated with buffer or with DcR3-Fc (40 p.g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6 -ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 u.1 aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 




IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound Iigand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anu-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemagglutinin (PHA; 2 u-g ml" 1 ) for 24 h, and cultured 
in the presence of interIeukin-2 ( 100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4 + cells 2 *. 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 51 Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target -cell death was determined by release of 51 Cr in effector- target co- 
cultures relative to release of 51 Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluororaetry. Amplification was determined by quantitative PCR" 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER- 2 oncogenes (data 
not shown). Gene-specific primers and fluorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to ~500 kilobases from T160, and five extra 
markers that span chromosome 20. The DcR3 -specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fluorogenic probe sequence was 5 ' - ( FAM - ACACGATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 UCT \ where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coli genome 
sequence revealed that the largest family of paralogous £ coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nudeo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains'. In 
prokaryotes these domains are often separate subunits which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of 5. typhimurium and £ co/i M_s is a 
well-characterized ABC transporter that is a good model for this 
superfamily. It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity 8 , and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded P- 
sheet (p3 and p8-p 12) spans both arms of the L, with a domain of a 
at- plus p-type structure (pi, p2, p4-[J7, al and ct2) on one side 
(within arm I) and a domain of mostly a-helices (a3-a9) on the 




Figure 1 Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A. comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The p-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a, towards arm II, showing the ATP-binding 
pocket, a-c, The protein and the bound ATP are in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C, C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbBZ) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings, Int. J. 
Cancer 78:661 -666, 1 998. 
© 1998 Wiley-Liss, Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosomal double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et a}., 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccndl (llql3), and erbhl (17ql2-q21) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbB2 proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Bems et al., 1992; 
Schuuring et al, 1992; Slamon et aL 1987). Muss et at. (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al (1987) between 
erb&2 amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in rumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (smalt, early-stage rumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et aL, 1996; Heid et 
ai, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et al. (1991). The latter uses the 5' 
nuclease activity ofTaq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et al, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et al, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the C, (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why C, is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C t 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr. 

Here, we applied this semi-automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and erbB2), as well as 2 genes (alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southem-blot data for the same 
samples. 



MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular- weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter C, (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C t and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3. in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et al, 1 994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N*\ and is determined as follows: 

copy number of target gene (app, myc, ccndl, erbQ2) 

N = ■ . 

copy number of reference gene (alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DN Agency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et al. (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
ng of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectro photometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (I0 5 copies of each gene) to 
10" 10 (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCl 2l 1.25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95 °C Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). AH 
samples with a coefficient of variation (CV) higher than 1 0% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates Q and determines the starting copy number in the 
samples. 



GENE AMPLIFICATION BY REAL-TIME PCR 



Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbBl proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (21q21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai. 1994). The 
reference disomic gene was the albumin gene {alb, chromosome 
4qll-ql3). 
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Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 2 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes (app and albj 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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Figure 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gene copy number ranging 
from 10 5 (A9), 10 4 (A7), 10 3 (A4) to 10 2 (A2) and a no-template control (Al). Cycle number is plotted vs. change in normalized reporter signal 
(ARn). For each reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(ROX), to obtain a ratio defined as the normalized reporter signal (Rn). ARn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. ARn increases during PCR as alb PCR product copy number increases until the reaction reaches a 
plateau. Q (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but the data for only one are shown here. Bottom: 
Standard curve plotting log starting copy number vs. C, (threshold cycle). The black dots represent the data for standard samples plotted in 
duplicate and the red dots the data for unknown genomic DNA samples plotted in triplicate. The standard curve shows 4 orders of linear dynamic 
range. 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app. 21q21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi et ai, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and crbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to 1.3 (mean 0.91 i 0. 19) for erbBl. Since N values 
for myc, ccndl and erbBl in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erbi?2 gene dose in breast-tumor DNA 

myc, ccndl and erbBl gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbBl (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for erbBl, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (TU8). 
The 3 genes were never found to be co-amplified in the same tumor. 
erbB2 and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbBl in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbBl copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbBl). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southem-blot analysis of myc, ccndl and erbBl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N s 5). However, there were cases (1 myc, 6 ccndl and 4 erbBl) 
in which real-time PCR showed gene amplification whereas 
Southem-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc. 
ccndl AND crbB2 GENES IN 108 HUMAN BREAST TUMORS 



Gene 




Amplification 


level (N) 




<0.5 


0.5-1.9 


2-4.9 




myc 


0 


97 (89.8%) 


11 (10.2%) 


0 


ccndl 


0 


83 (76.9%) 


17(15.7%) 


8 (7.4%) 


erbBl 


5 (4.6%) 


87 (80.6%) 


8 (7.4%) 


8 (7.4%) 



reagents and requires relatively large amounts of high-quality 
genomic DNA. which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, wc validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et ai, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of . 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et ai, 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin- fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables Q to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C t value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
C t ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-cell basis (Pauletti et ai, 
1996: Slamon et ai, 1989). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FrSH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in , 
the literature, (i) Chromosome regions 4qll-ql3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (ii) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1992; Borg et ai, 1992). (Hi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbBl amplifica- 
tion, confirming the findings of Borg et ai (1992) and Courjal etai 
(1997). (iv) The maxima of ccndl and erbBl over-representation 
were 1 8-fold and 1 5-fold, also in keeping with earlier results (about 
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23.2 
22.1 
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Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 18 (El 2, C6, black squares), T1 33 (Gl 1, B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Bems et al. t 1 992; Borg et ai, 1 992; Courjal et 
ai, 1997). (v) The er6B2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et ai, 1995; Deng et at, 1996; Valeron 



et ai, 1996). Our results also correlate well with those recently 
published by Gelmini et ai (1997), who used the TaqMan system to 
measure er6B2 amplification in a small series of breast tumors 
(n = 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perkin-Elmer Applied Biosystems) which only allows end- 
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TABLE II - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




ccndl 






alb 




hiccndj/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


T1I8 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




■121722 






7933 









'For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Hccndllatb) is determined by dividing the average ccndl 
copy namber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree . of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and erbB2) observed by means of real -time 
quantitative PCR as compared with Southern-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamon*/ a/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erb&2 (but not of the other 2 proto- 
oncogenes) in several tumors; erbBl is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre -neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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